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0.0 Introduction 



These notes were written in 2007 to accompany the Mods lecture courses of Calculus of One Variable and 
Calculus of Two or More Variables. These are mainly introductory courses in ordinary and partial differential 
equations but they also include other aspects of multivariate calculus. The notes occasionally digress beyond 
the syllabus, but such sections or comments will be clearly marked with an asterisk. They also include a wide 
range of exercises which can be found at the end of each section. The syllabuses for the two courses (as of 
Michaelmas 2007) are below. (The bullet points correspond to how the material has been arranged in chapters 
within these notes.) 

Syllabus for Calculus of One Variable (6 lectures) 

• Standard integrals, integration by parts. 

• Definition of order of an ODE - example of separation of variables. 

• General linear homogeneous DDEs: integrating factor for first order linear ODEs, second solution when 
one solution known for second order linear ODEs. 

• First and second order linear ODEs with constant coefficients. General solution of linear inhomogeneous 
ODE as particular solution plus solution of homogeneous equation. Simple examples of finding particular 
integrals by guesswork. Systems of linear coupled first order ODEs. The calculation of determinants, 
eigenvalues and eigenvectors. 

Syllabus for Calculus of Two or More Variables (10 lectures) 

• Introduction to partial derivatives. 

• Chain rule, change of variable; examples to include plane polar coordinates. Examples of solving some 
simple partial differential equations (e.g. fxy = 0, yfx = xfy). 

• Jacobians for two variable systems, calculations of areas including basic examples of double integrals. 

• Gradient vector; normal to surface, directional derivative. 

• Critical points and classification using directional derivatives (non-dcgcnerate case only). 

• Laplace's and Poisson's equation, including change of variable to plane polar coordinates and circularly 
symmetric solutions. The wave equation in two variables, including derivation of general solution. [This 
chapter appears earlier in the notes.] 

Other Reading: 

• D. W. Jordan & P. Smith, Mathematical Techniques, 3rd Edition, Oxford (2002) 

• Erwin Kreyszig, Advanced Engineering Mathematics, 9th Edition, Wiley (2005). 



If you have any suggestions on how to improve these notes, or spot any errors, feel free to email the author 
at earl@maths.ox.ac.uk 
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1. STANDARD INTEGRATION TECHNIQUES 



1.1 Trigonometric Substitutions 



We begin by looking at how to approach integrals such as 



Jo V4 - 2a; - x^ Jo 



•2a; + 2 

You may recall similar but simpler integrals from A-level such as 

da; 



+ a;2' 

To determine these we used trigonometric substitutions based on the identities 

sin2 6' + cos^6' = 1, tan^ (9 + 1 = sec^ 6", 1 + cot^ 61 = csc^ 6i. 

• The principle being in each case that we would substitute for a; in 1 — a;^ or in 1 + a;'^ the 
correct trigonometric function to leave us with a single square. 

So in the first integral we could use x = sin ^ as 1 — sin^ 9 = cos^ 9 and obtain 

f dx f d{sm9) fcos9d9 f ^ . -i 

/ , = / — - = / — = d9 = 9 + const. = sm a; + const. 

J ^/^^ J Vl - sin^ e J cos9 J 

Note that x = cos 9 would have worked just as well. 

Some similar standard trigonometric integrals are 

f dx . _i 

/ , = sm a; + const. 

J 



I 
I 
I 
I 
I 



dx _-, 

tan X + const. 



In 



= ln 



(^x + \/x^ + 1^ + const. = sinh ^ x + const. 



: + \/a;2 — 1 + const. = cosh ^ a; + const. 



= — — a;^ + const. 



1 + 

da; 
dx 
xdx 
xdx 

± 1 

a; da; 1 
1 +a;2 ~ 2 

If the fifth, sixth and seventh integrals are not apparent by inspection then a substitution u ~ x'^ would help. 
For those with knowledge of the hyperbolic functions the third and fourth integrals can be done more easily 
using the identity 

cosh^ t ~ I + sinh^ t. 
For more on hyperbolic functions see Section 1.3 at the end of this chapter. 



= -\/a;2 ± 1 + const. 
ln(l + .t2) + const. 
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By completing the square (and making a substitution if desired) every integral of the form 

f Ax + B ^ ^ f Ax + B 

/ -^r^ — da; and / , da; 

J Cx^ + Dx + E J ^Cx2 + Dx + E 

can be broken down into one of the previous forms. We return now to the integrals given at the start of the 
chapter. 



Example 1 Determine 



Solution. 



Ii = f '^^ and I2 = [ 

Jo V4-2a;-a;2 Jo 



a; da; 



a;2 + 2a; + 2 ■ 



C da; 

io V4 - 2a; - x'^ 

C da; 

•^0 ^5-(l + :c) 



[completing the square] 



2 Au 



[substituting u = 1 + a;] 



/■"/^^ ^/hdv r , . . ■ 

/ = substitutmg u = vbv 

J1/V5 \/5-5t>2 L 



'1/75 \/5 - 5t>2 

r . _i -12/75 

= [sm v\^^^ 

1 2 1 1 

= sin-1 - sin-i w 0.6435 
V5 V5 



The second integral I2 is left as Exercise 8. 



1.2 Integration by Parts 



Integration by parts (IBP) can be used to tackle products of functions, but not just any product. Suppose we 
have an integral 



j f{x)g{x) dx 



in mind. In the main: 



• This will be approachable with IBP if one of these functions integrates/differentiates, per- 
haps repeatedly, to something simpler, whilst the other function differentiates/integrates to 
something of the same kind. 

Typically then / (.;;) might be a polynomial which, after differentiating enough times, will become a constant; 
g {x) on the other hand could be something like e^, sin a;, cos a;, sinhx, cosh a;, all of which are functions which 
continually integrate to something similar. This remark reflects the nature of the formula for IBP which is: 

Theorem 2 (Integration by Parts) Let F and G he functions with derivatives f and g. Then 

j F {x) g (x) dx = F{x)G{x)- j f (x) G (x) dx. 
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STANDARD INTEGRATION TECHNIQUES 



IBP takes the integral of a product and leaves us with another integral of a product — but as we commented 
above, the point is that / (a;) should be a simpler function than F (x) was whilst G (x) should be no worse a 
function than g (x) was. 

Proof. The proof is simple — we just integrate the product rule of differentiation below, and rearrange 

^{F{x)G{x))=F{x)g{x) + f{x)G{x). 



Example 3 Determine 



I 



x^sina; dx and I x^e^^ dx. 



Jo 



Solution. Clearly x^ will be the function that we need to differentiate down, and sin x is the function that will 
integrate "in-house". So we have, with two applications of IBP: 



/ 



x^ sin X dx 



In a similar fashion 



Jo 



x^e^^ dx 



x^ (— cos x) — y 2x (— cos x) dx [IBP] 
-x^cosx + /2xcosxdx [reaxranging] 

— x^ cosx + 2xsinx — 2sinxdx [IBP] 

— x^ cos X + 2x sin X — 2 (— cos x) + const. 

(2 — x^) cos X + 2x sin X + const, [rearranging] 



2 Jo JO 



„2x 

Sx^— dx [IBP] 



e2 






1 








( 


3x - 








/' 


6x— — dx 


~ 2 




4 







Jo 


4 > 




3e 


2 








1 


/.I g2i- 






- + 


6x 






L 


~ 2 










"8~_ 

















1 


e2 3 


~ 4 


+ - 


4 






16 





"8^8' 



This is by far the main use of IBP, the idea of eventually differentiating out one of the two functions. There 
are other important uses of IBP which don't quite fit into this type. These next two examples fall into the 
original class, but are a little unusual: in these cases we choose to integrate the polynomial factor instead as it 
is easier to differentiate the other factor. This is the case when we have a logarithm or an inverse trigonometric 
function as the second factor. 



Example 4 Evaluate 



J (2x-l)ln(x^ + l) dx and J (x^ - 4) tan"^ x dx. 



Solution. In both cases the second factor looks rather daunting, certainly to integrate, but each factor differ- 
entiates nicely; recall that 

d , 1 , , d _i 1 

— Inx = — and that — tan x = 5-. 

dx X dx 1 + x^ 

So if we apply IBP to the above examples then we get 

J (2x - 1) In (x^ + 1) dx = (x^ - x) In (x^ + ^) - j {x^ - x) dx, 
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and 



J (3a;^ - 4) tan ^ a; da; = (a;^ - 4x) tan ^x - J (x^ - 4x) ^2^1 
To calculate the integrals 



2x^ - 2a;2 , f - 4x ^ 

ax and / — - — — dx 



.x2 + 1 ,/ .t2 + 1 

we need to divide the denominator into the numerator and then use the previous list of standard integrals. The 
second integral is left to Exercise 9. To determine the first integral, we need to divide a;^ + 1 into 2x^ — 2x^, 
either by inspection or by long division. Note 

2x -2 

x^ + l [2p =2p 

2a:^ +2a; 
-2a;^ -2a: 

-2a;^ -2_ 

-2a; +2 



so that 
Hence 



2a;^ - 2a;^ = (2a; - 2) [x^ + l) + (-2a; + 2) . 



f 2x^ - 2a;2 , /" / ^ -2a; 2 \ 

/ o -r- da; = / 2a: - 2 + + 

J x^ + l J \ x^ + 1 x^ + lj 



dx 



= x^ — 2x — In (a;^ + l) + 2 tan ^ a; + const. 



In the same vein as this we can use IBP to integrate functions which, at first glance, don't seem to be 
products — this is done by treating a function F (x) as the product F (x) x 1. 

Example 5 Evaluate 

Jinx ax and jt.n-^xdx. 
Solution. With IBP we see (integrating the 1 and differentiating the In a;) 

j In a; da; = j 1 xlnx dx = x\nx — j x—dx = x\nx — j da; = a;lna; — a; + const. 

and similarly 

/tan~^ a; da; = /" 1 x tan~^ a; da; = a;tan~^ x~ I x — dx = a;tan~^ x— - In (l + a;^) + const. 
J J 1 + x^ 2 ^ ^ 

■ 

Sometimes both functions remain "in- house", but we eventually return to our original integrand. 
Example 6 Determine 

J e^sir 



"sin a; dx. 



Solution. Both of these functions now remain in-house, but if we apply IBP twice, integrating the and 
differentiating the sinx, then we see 

dx 



J e^ sin X dx = e^ sinx — J e^ cos x 

J e'^ (-sinx) 



e sm X — e cos x 



e^ (sinx — cosx) ~ J sinx dx. 
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We see that we have returned to our original integral, and so we can rearrange this equation to get 



/ 



sin a; da; = ^e^ (sin a; — cos a;) + const. 



When IBP needs to be applied repetitively to determine an integral it can often make sense to consider the 
general case and set up a reduction formula. For example, in order to calculate 



J CDs' Ode 

we first set 



e de, 



and we will aim to write /„ in terms of other where k < n, eventually reducing the problem to calculating 
Iq, or Ii say, which are simple integrals. Using IBP we see 

= cos^'-^esine- j (n- l)cos"-2 6'(-sin6')sin6' d6' 

= cos"-^ 6» sin ^ + (n - 1) j cos"-^ e{l- cos^ 61) dO 

In = cos"-i^sin^ + (n-l)(7„_2-/„). 

Employing the identity sin^ 9 = 1 — cos^ 9 returns the integral to ones involving powers of cos 9 only. Rearranging 
the last equation to make 7„ the subject we see 

cos"~^^sin^ n—1 

in = 1 in-2- 

n n 

With this reduction formula /„ can be rewritten in terms of simpler and simpler integrals until we are left only 
needing to calculate Iq, if n is even, or /i, if n is odd — both these integrals are easy to calculate. 

Example 7 (See also Exercise 13.) Calculate 

l7 = j cos^ 9 d9. 

Solution. Repeatedly using the reduction formula above, we see 

cos^ 9sui9 6 ^ 

cos6^sin6l 6 / cos'^ 6* sin 6* 4 



7 7 V 5 5 

cos6 6'sin6' 6cos4 6isin6i 24 / cos^ 6* sin 6^ 2, 

+ ^ + TT^ I ^ + TT^l 



7 35 35 V 3 3 

cos6 6lsin6' Gcos^^sin^ 24 cos^ 61 sin 6* 48 . ^ 

7 + 35 + 105 + Y^sm^ + const. 



Example 8 Calculate 



f 

Jo 



aj^e^^ da; 
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Solution. We already met this integral in Example 3. We can approach this in a simpler yet more general 
fashion by setting up a reduction formula. For a natural number n, let 



f 

Jo 



a;"e^^ dx. 



We can use integration by parts to show 



n 1 







^0 



6^ n 

— dx = — - -j^Jn-i if n ^ 1 



which is our reduction formula. We first note 



-f 

Jo 



Jo = / e^^ dx = 
/o 



^2x 



e^-l 



and then applying the reduction formula the calculations made in Example 3 look so much easier on the eye: 



3e2 3 1 



Some integrands may involve two variables, such as: 
Example 9 Calculate for positive integers m, n the integral 

B{m,n)= [ x™-^(l-x)""^ dx. 

Solution. Calculating either B (m, 1) or B (1, n) is easy; for example 

B(m,l) = [ x™-^ dx= -. 

Jo m 



(1.1) 



So it would seem best to find a reduction formula that moves us towards either of the integrals B (m, 1) or 
B {l,n). Using integration by parts, if n > 2 we have 



B (m, n) 



— (1-x) 
m 



-1 1 







Jo m 



X (n- 1) X (-l)(l-x)""^ dx 



= + / x™ (1 - x)"-' dx 

/o 



m 



n-1 



m 



B{m+l,n-l). 



So if n ^ 2 we can apply this to see 



n—1 

B{m,n) = B(m + l,n-l) 



m 

n — 1 n — 2 

X 



m m 
n— 1\ / n — 2 



-B(m + 2,n-2) 
1 



m, / \ rn + 1 
ft — 1 \ / n — 2 



m J \m-\- 1 

(m- l)!(n- 1)! 
(m + n — 1)! 

Equation (1.1) shows this formula also holds for n = 1. 



TO + n — 2 
1 



B(m + n- 1,1) 
1 



m + n — 2 / m + n— 1 
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1.3 Appendix: Hyperbolic Functions 



Given a real (or complex) number z then hyperbolic cosine cosh z and hyperbolic sine sinh z are defined by: 



cosh z = 



sinh^; 



2 ' 2 
In a similar fashion to the trigonometric functions the following are also defined 



tanh z = 



sech z = 



sinh 2; 

cosh z ' 
1 

cosh z ' 



The graphs of the sinh, cosh and tanh look like 



coth z = 



csch z = 



cosh 2; 

sinh 2 
1 

sinh z ' 




The graph of y = sinh x The graph of y = cosh x The graph of y = tanh x 

The functions can also easily defined by power series, which converge for all real (or complex) z. 



y2n+l 



sinh z = z-\- 



and 



cosh z = 1 + 



3! 5! (2n + l)! 2! 

In a similar manner to trigonometric functions the following identities hold: 



z 

IT 



+ 



y2n 



(2n)! 



cosh^ z ~ 1 + sinh^ z, 



sech 



tanh z = 1, coth z = 1 + csch 



Consequently as t varies over the reals (± cosht, sinht) parametrise the two branches of the hyperbola x^—y'^ = 1, 
which explains the name hyperbolic functions. 

The hyperbolic functions also satisfy the following identities 

sinh {z + w) = sinh z cosh w + cosh z sinh w, sinh 22: = 2 sinh z cosh z, 

cosh (z + w) = cosh z cosh w + sinh z sinh w, cosh 22; = 2 cosh^ 2; — 1 = 2 sinh^ 2; + 1, 

. tanh w + tanh z 2 tanh z 

tanh(2; + w) = — , tanh 22;= 5—. 

1 + tanh w tanh z 1 + tanh z 



sinh(i2) = isin2;, sin(i2) = isinh2;, 
cosh (iz) = cos z, cos (iz) = cosh z. 

The derivatives of the trigonometric functions are 

cosh z = sinh 2, sinh z = cosh z, tanh z = sech^2;. 

dz dz dz 

Their inverses are given by the following formulae: 

sinh""^ X = In + \/ x'^ + Ij for x e M; 

cosh~^ X = In ^a; + ■\/ x^ — for a; ^ 1; 

1 / 1 + a; 
tanh" X = - In I ) for |a;| < 1. 

2 \ 1 — a; 
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1.4 Exercises 



Exercise 1 Determine 



J da;, J X sec^ x dx, j 



Ax 



4 cos a; + 3 sin a; 



Exercise 2 Determine 



Exercise 3 Evaluate 



Exercise 4 Evaluate 



I a;^lna;da;, / [ 

J J 1 + y/x J smhx 

r'"^ r-. — 1 /"^ _i , 

/ 7 -7 -, / COS a;v sin a; da;, / tan x ax. 

J3 [x-l){x-2) Jo Jo 

[°° dx dx 

J2 x^x - 1 ' io ' io + 1 



Exercise 5 Evaluate, using trigonometric and/or hyperbolic substitutions, 

dx 



r dx r dx r 
A J V4^' J2 



h (a;2 - if'^ ■ 

Exercise 6 -Bj/ completing the square in the denominator, and using the substitution 

X = —— tan y — - 
3 3 

determine 

dx 



I 



3x2 + 2a; + l" 

Exercise 7 Determine 



I r 

J Va;2 + 2a; + 5' Jo 



da; 



4a;2 + 4a; + 5 ■ 

Exercise 8 By completing the square in the denominator, or otherwise, show that 



a; da; tt 1 , /5\ . 

Jo x^ - ■ - = - + ^1^^' -^-^^'^ 



+ 2a; + 2 4 2 V2 

Exercise 9 Determine 



/x"^ — 4x /" x^ 

— 7: — — da; and / — - dx. 
x^ + 1 J x-^ — 1 



Exercise 10 Let 

r-2-n 

In = I sin" X dx. 



/ sin" X 
Jo 

/or n ^ 0. f/se integration by parts to show that, for n > 2, 

/„ = (n - 1) (/„_2 - /„) 

anrf /lence set up a reduction formula giving In in terms of In-2- Show by induction, or otherwise, that 



hk = 



2k 



k I 22fe-i' 
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Exercise 11 Let 

sin a; dx ^ f cosx dx 



/■ sina; dx f 
h = / -. ; , I 

J sm a; + cos x J 



sm X + cos X 

By considering I\ + 12 and I2 — Ii find Ii and J2. generalise your method to calculate 

cosx dx 



/sin a; da; , f 

— : — , and / — 
a sin a; + cos x J at 



t sin a; + 6 cos a; 

Exercise 12 Use the substitution a; = | (1 — sin^) to show that 



Exercise 13 Recall that 2 cos 6 = e*^ + e Show that 

128 cos^ 6" = 2 cos 76* + 14 cos 56* + 42 cos 361 + 70 cos 6, 

and deduce that 



dO = sin 79 H — — sin 56 + sin 36 + ^ sin ^ + const. 
448 320 64 64 



/cos^ 

Exercise 14 Show that sinh""^ x = In (a; + Va;^ + l) /or a; e K. 
Exercise 15 Consider the integrals 

/ x" da; and / x^ dx 

where < e < 1 < R and a, /? are reaZ. _For what values of a does the first integral remain bounded as e becomes 
arbitrarily sm,oll? For what values of (3 does the second integral remain bounded as R becomes arbitrarily large? 

Exercise 16 Let t = tan ^6. Show that 

. ^ 2t ^ 1-t^ ^ 2t cW 2 

sin (f = ?r , cos (f = ?r , tan 6 = ^ and 



1+^2' 1+^2' l_i2 l + i2- 



Use the substitution t = tan ^6 to evaluate 



^/^ de 

(l + sine")^' 



Exercise 17 Let 

p7r/2 

'sin a; dx. 



/ X"i 

Jo 



Evaluate Iq and I\ . 

Show, using integration by parts, that 



(7]-\ n— 1 
-j -n(n- 1)/„_2- 



Hence evaluate I5 and Iq. 
Exercise 18 Let 

Show that 



for n ^ 2 and hence find I5. 

Given that Iq = y/Tr/2, calculate 1%. 



in= r 

Jo 



x"e dx. 



J-n — — n — -'n-2 
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Exercise 19 Show that 



I 



5 5 1 

cos^ X dx = - sin a; + — sin 32: + — sin 5x + const. 
8 48 80 

2 . 1.5 

= sin x sni" X H — sm x + const. 

3 5 



Exercise 20 Show that 



[ a;" (In a;)" dx = """^ ^^"^^ ^ / a;" (In a;)""' da;. 

J ^ ' n+1 n+lj ^ ' 



Hence find J x'^ (ln.x)^ dx. 
Exercise 21 Show that 



+ 1 = f + 1 + V2v^ (u^ + l- V2uj . 



By making the substitution x = tan ^ u"^, or otherwise, find 

fVt^xdx. 

Exercise 22 Show that 

d , -1 1 
— tann a; 



Find real numbers A, B, C, D, E, F such that 



dx 1 — x"^ 

t 

A B Cu + D Eu + F 



1 — u^' 1 — u 1 + u u^+u + l u'^ — u + 1 



for all real values u. Hence, by using the substitution x = tanh~ w^, or otherwise, determine 

J \/ tanh X dx. 

Exercise 23 For n = 0, 1, 2, . . . define 



Show that 
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2. DIFFERENTIAL EQUATIONS 



2.1 Introduction and History 



The study of difFerential equations (DEs) is as old as calculus itself and dates back to the time of Newton (1643- 
1727) and Leibniz (1646-1716). At that time most of the interest in DEs came from applications in physics and 
astronomy — one of Newton's greatest achievements, in his Principia Mathematica (1687), was to show that a 
force between a planet and the sun, which is inversely proportional to the square of the distance between them, 
would lead to an elliptical orbit. 

The study of differential equations grew as increasingly varied mathematical and physical situations led to 
many different differential equations, and as more and more sophisticated techniques were found to solve them. 
Besides in astronomy, DEs began appearing naturally in other applied areas such as fluid dynamics, heat flow, 
vibrations on strings, in determining the curve a chain between two points will make under its own weight, and 
equally in pure mathematics, in finding the shortest path between two points on a surface, the surface across 
a given boundary of smallest area (i.e. the shape of a soap film), the largest area a curve of fixed length can 
bound, etc. Still today much of applied mathematics is concerned with the solution of differential equations that 
arise from the modelling of real world situations, whether the aim is to describe financial markets or biological 
systems. 

In this first course we will be studying ordinary difFerential equations (ODEs) rather than partial 
difFerential equations (PDEs). This means that the DEs in question will involve full derivatives, such as 
dy/dx, rather than partial derivatives, such as dy/dx. The latter notation is a measure of how a function 
y changes with x whilst all other variables (which y depends on) are kept constant. We will meet partial 
derivatives in the second course (see Chapter 5 onwards). 

We give here, and solve, a simple example which involves some of the key ideas of DEs; the example here 
is the movement of a particle P under gravity in one vertical dimension. Suppose that we write h {t) for the 
height (in metres, say) of P over the ground at time t. If we ignore air resistance and assume gravity (denoted 
as g) to be constant then h satisfies the DE 

The (upward) velocity of the particle is the qiiantity dh/dt — the rate of change of distance with time. The 
rate of change of velocity with time is called acceleration and is the quantity d^/i/df^ on the LHS of the above 
equation. The acceleration here is entirely due to gravity. Note the need for a minus sign here as gravity is 
acting downwards. 

Equation (2.1) is not a difficult DE to solve; we can integrate first once, 

^ = -gt + Ki (2.2) 

and then again 

h{t)^^gt^ +Kit + K2 (2.3) 

where Ki and K2 are constants. Currently we don't know enough about the specific motion of particle P to 
be able to say anything more about these constants. Note though that whatever the values of Ki and K2 the 
graph of h against f is a parabola. 

Definition 10 An ordinary differential equation is a equation relating a function, say y, in one variable, 
say X, and finitely many of its derivatives, i.e. something that can be written in the form 



f (y ... ^\ = 

\ ' da; ' dx^ ' ' da;*' / 
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for some function f and some natural number k. Here x is the independent variable and the DE governs how 
the dependent variable y varies with x. The equation may have no or many functions y (x) which satisfy it; 
the problem usually is to find the most general form of solution y{x), a function which satisfies the differential 
equation. 

Definition 11 The derivative d'^y/dx'^ is said to be of order k and we say that a DE has order k if it involves 
derivatives of order k and less. 

Example 12 The following are types and famous examples of ordinary differential equations. 

• A first order differential equation is one of the form 

• A A;th order inhomogeneous linecir differential equation is one of the form 

, , d'^y , , d'^^^y / \ . ^ ^ , 

^""^ d^ ^ d^ +---+ai{x) — +ao{x)y = f {x) , 

and the equation is called homogeneous if / {x) = 0. 

• The differential equation 



da; 

uniquely characterises the function y = e^. 
• The equation for simple harmonic motion is 



^=2/> y(0) = l 



dt^=-"^ 

and the DE governing the swinging of a pendulum is 

d'e g 

dt^=-r^"^- 

• Bessel's equation, which relates to vibrations in a circular membrane, is 

ax^ da; ^ ' 

• Legendre's equation, which relates to Laplace's equation, which we will meet in the second course, 
is 



We return now to equation (2.1), which is a second order DE. In some loose sense, solving a DE of order 
k involves integrating k times, though not usually in such an obvious fashion as in the case of (2.1). So we 
would expect the solution of an order k DE to have k undetermined constants in it, and this will be the case in 
most of the simple examples that we look at here. However, this is not generally the case and we will see other 
examples where more, or fewer, than k constants are present in the solution. 

The expression (2.3) is the general solution for the solution of the DE (2.1), that is an expression which 
encompasses by means of indeterminates, Ki and K2, all solutions of the DE. At the moment then our solution 

h it) = ^gt" + Kit + K2 

is not unique, but rather depends on two undetermined constants. And this isn't unreasonable as the particle P 
could follow many a path; at the moment we don't have enough information to characterise the path uniquely. 
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One way of filling in the missing info would be to say how high P was at t = and how fast it was going at 
that point. For example, suppose P started at a height of 100m and we threw it up into the air at a speed of 
10ms~^ — that is 

h (0) = 100 and ^ (0) = 10. (2.4) 
Putting these values into equations (2.2) and (2.3) we'd get 

10 = h'{0) = -gxO + Ki giving Ki = 10, 

and 

100 = /i (0) = xO^ + K1XO + K2 giving K2 = 100. 
So the height of P at time t has been uniquely determined and is given by 

h{t) = 100+ lOf- ^gt^. 

The extra bits of information given in equation (2.4) are called initial conditions as they relate to the situation 
at t = — the DE (2.1) with the initial conditions (2.4) is called an initial- value problem. 

Alternatively, suppose we were told that P was thrown at time t = from a height of 100m and was 
subsequently one second later caught at 105m. That is 

/i(0) = 100 and /i(l) = 105. (2.5) 

Putting these values into the general solution (2.3) gives us 

K2 = 100, 

:^g + K,+K2 = 105^/fi=5+|. 



Hence 



h{t) = (5+|)t+100. 



Again we have uniquely characterised the trajectory of P by saying where the particle is at two times. The 
conditions (2.5) are called boundary conditions and the DE (2.1) with the boundary conditions (2.5) is called 
a boundary-value problem. 

Having solved the earlier initial-value problem and found an equation for h, namely 

h (t) = 100 + Wt - ^gt'^ 
then we could easily answer other questions about P's behaviour such as: 

• what is the greatest height P achieves? The maximum height will be a stationary value for h (t) and so 
we need to solve the equation h' (0) — 0, which has solution t = 10/g. At this time the height is 



Mio/.) = ioo + H_^ = ioo + ^. 

9 W 9 



what time does P hit the ground? To solve this we see that 

= h{t) = 100 + lot - ^gt^ 

has solutions 



^_ -10± V100 + 2005 
-9 

One of these times is meaningless (being negative, and so before our experiment began) and so we take 
the other (positive) solution and see that P hits the ground at time 

^^lO + W^/T+Jg 
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The next example is designed to show that we should not be cavalier when solving DEs. 



Example 13 Find the general solution of the DE 



dy 
dx 



Solution. Given this equation we might argue as follows — taking square roots we get 



dy 
dx 



2W 



1 dy 
^s/ydx 



(2.6) 



(2.7) 



Prom the chain rule we recognise the LHS as the derivative of ^ and so, integrating wrt x, we have ^ = x+K, 
where K is a constant. Squaring this, we might think that the general solution has the form y = (x + K)^ . 

What, if anything, could have gone wrong with this argument? We could have been more careful to include 
positive and negative square roots at the (2.7) stage, but actually we don't lose any solutions by this oversight. 
Thinking a little more, we might realise that we have missed the most obvious of solutions: the zero function, 
y = 0, which isn't present in our 'general' solution. At this point we might scold ourselves for dividing by zero 
at stage (2.7), rather than treating y = as a separate case. But we have lost many more than just one solution 
at this point here by being careless. The general solution of (2.6) is in fact 




X ^ a 
X ^ b 
X 



where a and b are constants satisfying — oo ^ a ^ 6 ^ oo. We missed whole families of solutions by being 
careless — note also that the general solution requires two constants in its description even though the DE in 
(2.6) is only first order. 




-2-1 12 3 

A missed solution with a = 0, b = 1. 



2.2 Graphical Considerations 



Consider the first order differential equation 



dy 
da; 



= x-y. 



By the end of §3.1 we will be able to solve such equations using integrating factors. The general solution is of 
the form y = x — l + Ae^^ . But even without this knowledge we can perform some qualitative analysis on what 
the solutions might look like. 



In the first diagram below is a plot of the direction field of the DE — that is at each point {x, y) in the diagram 
is drawn a short vector (here of unit length) whose gradient equals x — y. Any solution of the DE is a function 
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X y{x) whose graph is tangential at each point to an arrow. 





/ / / / 
/ / / / 
I I f f / ^ 



The direction field is plotted for — 3 < a;, y < 3 and the solution y — x — 1 -\- e^^ is represented in bold. Even 
though we are not currently able to solve the DE we can see that solutions typically remain on one side of the 
line y = X — 1 and tend towards this line as x increases. 
The second direction field is for the DE 

dy —X 

dx ?y + 1 

which we will solve in the next section. The solution that is plotted is y = — 1 — V4 — a;^, which is valid for 
—2<x<2. This is the solution to DE with initial condition y (0) — —3, as we will see. Again, even though 
we have not yet solved the DE, the solutions' semi-circular nature is clear and also that the solutions stay on 
either side of the line y = —1. As each solution approaches the line y = — 1 we see that there is nowhere further 
that we can extend the solution; that is, there is no function y of x with a domain greater than —2<x<2 
which satisfies the DE and initial condition. 



2.3 Separable Equations 



Recall that a first order differential equation has the general form 

A special class of first order differential equation contains those of the form 

^=aix)b{y) 

and these are known as separable equations. Such equations can, in principle, be easily rearranged and solved 
as follows: 

and then integrating with respect to x, we find 

/J* =/«(.) dx. (Z9, 

Remark 14 In some texts, or previously at A-level, equation (2.8) may be written 

dy 



Hy) 



a{x) dx (2.10) 



and subsequently integrated. At first glance, though, this makes no rigorous sense, dy/dx exists as the limit 
of an approximating gradient Sy/6x whereas in the limit 6x and 6y are both 0. So (2.10) seems to say nothing 
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more than = 0. However the equation can be made rigorous with knowledge of differentials (which are beyond 
the scope of this course) and no error will result from an equation like (2.10), but we should recognise, with our 
current definition of dy/dx, that the equation has no rigorous meaning. 

Remark 15 Equation (2.9) may offer some practical problems as a solution to the differential equation. Firstly 
the functions 1/6 and a may not be readily integrable, if at all. But further, even if they both are nicely 
integrable, to end with a solution of the form 

B{y) = A (x) + const. 

is this really solving the differential equation as asked? We originally said a solution was a function y (x) which 
satisfies the differential equation, and our solution is not in this form, and may be difficult to put into this form. 

Example 16 (Exponential Growth and Decay) In many examples from science the rate of change of a variable is 
proportional to the variable itself (e.g. growth in a bacterial sample, a capacitor discharging, radioactive decay). 
That the rate of change with time t of a quantity, say y, is proportional to y is encoded in the differential 
equation 

where k is a constant. If we separate the variables we find 

-t = ^ 

ydt 

and integrating with respect to t we have, for some constant C, 

Iny = kt + C ^ y = Ae''* (2.11) 

where A is another constant. This is the general solution of the given differential equation, where A is any real 

number. Note that we can characterise our solution uniquely with an initial condition. In fact the exponential 
function y {t) — e* is characterised as the one solution of the initial value problem 

^ = y, .(0) = i. 

Remark 17 This presentation of the solution is rather "A-level". Two points have been overlooked. Firstly 
what if y = 0? Well, we can deal with the y = case by treating it as a separate case. Secondly though is 
to note that y = Ae*'* is a solution when A < yet here A = e'-' seems always to be positive. And for the 
remaining argument we should really write 

=A;=>ln|y| = kx + C 

V 

as In is an indefinite integral of the LHS whether or not y is positive. Then 

|y| = e'^e'^^ ^y = ie^e*^^ 

gives the correct general solution. 

Example 18 Find the general solution to the separable differential equation 

s.mx^=y\ny. (2.12) 



Solution. Separating variables we find 
and integrating with respect to x we get 

which rearranges to 



1 dy 1 



y In y da; sin x 



I X I 

In |lny| = In tan — + C, 



y = exp{Atan(|)} (2.13) 

where A is another constant. The solution is valid in the range — tt < a; < tt (or any similar domain between 
discontinuities of tan.). ■ 
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Example 19 Find the solution of the initial value problem 

= — -T' 2/(o) = -3- 
da; y + l 

Solution. The equation is separable and so we may rearrange it to 



(2.14) 



and integrate with respect to x to find 



As y = — 3 when a; = then C = 2 and we have 

x'' + iy + lf = 4, 



(2.15) 



which is the equation of a circle. By definition, though, a solution of a differential equation is a function y of 

X whereas for each x in the range —2 < a; < 2 there are two y which satisfy (2.15). But any solution y{x) of 
(2.14) must satisfy (2.15), so that we can solve for y obtaining 



y 



= -1- V4^ 



Note we need to take the negative root as y (0) = —3. Note also that the solution is valid in the range — 2 < a; < 2. 



Example 20 Find the solution of the initial value problem 
Solution. We can straight away integrate both sides wrt x and obtain 



y - 

Putting a; = and y = 1 we see that C = and so 



■y = x + C. 



y -y = x. 



(2.16) 



This equation does not give us a simple expression for the solution y in terms of x, but the solution y has to 

satisfy equation (2.16). 

If we sketch the curve y^ — y = x (see below) we see that the solution y, which goes through the point (0, 1) , 
can be extended arbitrarily to greater x but can only be extended down to ^^^) where dy/dx becomes 
infinite — there are no values we can assign to y for a; < without having a discontinuity in the solution y. 

On the other hand, we cannot solve cqiiation (2.16) exactly for given x and it remains our only way to 
describe the solution. It should be appreciated though that the whole curve is not a solution but just the part 
which appears in bold on the graph below. 



1.5 
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2.4 Equations Adjustable to Separable Equations 



By a homogeneous polar differential equation we will mean one of the form 

dx \x/ 



(2.17) 



Note these DEs are often simply called homogeneous, but we will use the term homoge- 
neous polar here to distinguish them from other DEs we will meet later and refer to as 
homogeneous. 



These can be solved with a substitution of the form 

y{x)=v (x) X 



(2.18) 



to get an equation in terms of x and the new dependent variable v. Note from the product rule of differentiation 
that 

dy dv 

— = v+x— 
dx dx 

and so making the substitution (2.18) into the DE (2.17) gives us the new DE 

dv 



dx 



which is a separable DE. 

Example 21 Find the general solution of the DE 

dy X + 4y 



dx 2x + 3y 

Solution. At first glance this may not look like a homogeneous polar DE, but we can see this to be the case 
by rewriting the RHS as 

1 + 4^ 



2 + 3^^' 

If we make the substitution y {x) — xv {x) then we have 

dv l + 4v 



v + x 



dx 2 + 3v' 



Rearranging this gives 



dv l + Av l + 2v - 3t;2 

x-r- = V = 



dx 2 + 3v 



and so, separating the variables, we find 



j 1 + 2V-3V- = / 



2 + 3v 
da; 



Using partial fractions gives 



5 3 
+ 



l-v l + Zv 

and resubstituting v = yjx leads us to the general solution 



dv = In |a;| + C, 





.1. 


. 3y 
1 + — 


= In \x\ + C. 


X 




X 





This can be simplified somewhat to 
where A is a constant. ■ 



x + 3y = A{x -y) 
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Example 22 Solve the initial value problem 

dy y 



dx X + y + 2' 



y{0) = l. (2.19) 



Solution. This DE is not homogeneous polar, but it can easily be made into such a DE with a suitable change 
of variables. We introduce new variables 

X = x + a and Y = y + b, 

with the aim to choose a and b so that the DE becomes homogeneous polar. If we make these substitutions 
then the RHS equals 

Y-b 
X + Y + 2-a-b 

which is homogeneous polar if 6 = and 2 — a — 6 = i.e. a = 2. With these values of a and b, noting further 
that dY/dX = dy/dx and that the initial condition has become 1 = y{x = 0) = Y (x = 0) = Y {X = 2) , our 
initial value problem now reads as 

Substituting in y = VX gives us 
Rearranging and separating the variables gives 

i-iniyi=/r-i,-ii dv=^^-^=^.\xuK. 



V J \ V ) J X 

Substituting in our initial condition, V {2} = 1/2, we see that K = 2 and so 

:^-lny = lnX + 2, X,V>Q 
becomes, when we remember V = Y/ X, and with some rearranging 

x-yiny = 2y x,y>o. 

Further, ss X = x + 2 and Y = y, our solution to the initial value problem (2.19) has become 

x + 2 = 2y + y\ny x > -2,y > 0. 



Example 23 By means of a substitution transform the following into a separable equation and find its general 
solution: 

— = cos (x + y) . 

dx 

Solution. This is neither separable, nor homogeneous polar, but the substitution 

z = x + y 

would seem a sensible one to simplify the RHS. We might then hope to get a separable equation in x and z. As 
y = z — X then 

d^ 

1 = cos z. 

dx 

This is separable and we find 

x + C. 



r dz 

J cos z + 1 
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Integrating the LHS is made easier with the trigonometric identity cos 2; + 1 = 2cos^ (^/2) , giving 

tan ^ = \ [ se(? dz = x + C. 
2 2 J \2) 



tan ( ^— ^ \=x + C 



So 



and rearranging gives 

y = 2tan~^ [x + C) — x, x G . 

as the general solution. ■ 



2.5 Exercises 



Exercise 24 Which of these solutions in (2.13) satisfy the initial condition y (0) = 1? How many solutions 
satisfy the initial condition y (0) = 2? Why are these answers unsurprising when we look at the original 
differential equation (2.12)? 

Exercise 25 Find the solutions of the initial-value problems: 

^ = xe^, y{0) = Q; ^ = -{x + l)y\ y(0) = ^. 



Exercise 26 Find the general solutions of the following separable differential equations. 



dy x^ dy cos^ x dy 



e 



da; y ' dx cos^ 2y ' dx 
Exercise 27 Find all solutions of the following separable differential equations: 

dy y — xy dy sin~^ x 

dx xy-x' dx y^i/l - ' 

Exercise 28 Let x {t) be the solution of the initial-value problem 

J = (l + x)e^ x(0) = 0, 
and let T be the value oft for which x (T) = 1. Express T as an integral and show that 

l-h<'^<'--e 

Exercise 29 By making a substitution z = dy/dx, or otherwise, solve the initial-value problem 



S^^^ + ^'^'KS' ^herey {1) = and y' (1) - ^ 



Exercise 30 Find the solution of the following initial-value problems. On separate axes sketch the solution to 
each problem. 

dy l-2x dy x {x^ + 1) -1 dy 1 + y' 
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Exercise 31 Consider the initial-value problem 



w/iere k is a positive constant. 

(a) By introducing the change of variable y = fc^sin^ (^/2), or otherwise, show that 

X {6) ='^{9 -sine), j/(^) = y(l-cos^). 

(b) Sketch the curve for 0^0^ 27r. Show that, in this range, the curve has arc-length ik^ and bounds (with 
the X-axis) a region with area 3nk'^/4. 



Exercise 32 Find two differentiable functions y : K ^ M which solve the initial value problem 

= \y\, y(o) = i. 



dy 
dx 



Exercise 33 Verify that 

^ln2 
i2<x 



yU) = l forxi 
y^^' \ l-2e-^ for In: 

is a solution of the initial value problem 



Sketch a graph of the solution. 

Exercise 34 Find the solution of the initial-value problem 

d?y 1 



da;2 1 + 



y{Q) = y' (0) = 0, 



and confirm that y (1) = J — | ln2. 



Exercise 35 By making the substitution y {x) = xv {x) in the following homogeneous polar equations, convert 
them into separable differential equations involving v and x, which you should then solve. 

— = ; X— =y+Va;2+r• 
da; xy dx 

Exercise 36 Solve the initial-value problems 

dy ^ 23;?;^ + x :„..dt/ 

da; x'^y — y 

Exercise 37 Solve the equation 



, y(^V2^—0; sin X sin t/^ = cos a; cosy, ^ ("^) ~ ^' 



2^y , 2 
a:-+a;y = y. 

Exercise 38 By introducing the new dependent variable z = y"^ solve the equation 

{y'-3x^)f^+xy = 0. 

Exercise 39 Make substitutions of the form x = X + a, y = Y-\-b, to turn the differential equation 

dy _ a; + y - 3 

da; a; — y — 1 

into a homogeneous polar differential equation in X and Y . Hence find the general solution of the above equation. 
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Exercise 40 Show that the differential equation 

dy x + y -1 



dx 2x + 2y-l 

cannot be transformed into a homogeneous polar differential equation by means of substitutions x = X + a, 
y = Y + b. 

By means of the substitution z = x + y find the general solution of the equation. 
Exercise 41 A particle P moves in the xy-plane. Its co-ordinates x {t) and y [t) satisfy the equations 

dy , da; 

— = x + y and — = x — y, 

dt ^ dt 

and at time t = the particle is at (1, 0) . Find, and solve, a homogeneous polar equation relating x and y. 
By changing to polar co-ordinates, or otherwise, sketch the particle 's journey for 0. 

Exercise 42 Find the general solutions of the differential equations 

^ = sin2 {x-2y); x^+y = e^^. 
dx dx 

Exercise 43 Find the general solutions of the differential equations 

sin^ X- — h (sin^ x + (x + y) sin 2a;) =0; — = a; + — 1. 

da; ^ V / / 

Exercise 44 Find the general solutions of the differential equations 

Exercise 45 Find the solutions of the initial-value problems consisting of the differential equation 

and the initial conditions (a) y{0) = 2, (bj y{0) = \, (c) y{0) = —2. Sketch these three solutions on the same 
axes. 

Exercise 46 If u = 1 + tany calculate d (Inn) /dy. Hence find the general solution of 

dy 



^ tan X cos y (cos y + sin y) . 



Exercise 47 Find a complete solution of 
where A is a positive constant. 

Exercise 48 * Write the left hand side of the differential equation 

{2x + y) + {x + 2y)^= 0, 

in the form 

where F y) is a polynomial in x and y. Hence find the general solution of the equation. 
Use this method to find the general solution of 



{ycosx + 2xe^) + (sina; + x^e^ — l) ^ = 0. 
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3. LINEAR DIFFERENTIAL EQUATIONS 



An important class of differential equations consists of the linear differential equations. These are important 

because of their theory, })cx-aiisc a great many important ODEs arc linear, and also because a linear differential 
equation can sometimes be used to approximate a non-linear equation. 

Definition 24 An inhomogeneous linear DE of order k is one of the form 

d^y d.^~^y dy 

d^ ^ d^ + ---+ai{x) — + ao{x)y = f (x) 

where ak {x) ^0. If f {x) = then the DE is called homogeneous linear. 



3.1 Integrating Factors 



A first order homogeneous linear differential equation is one of the form 

P{x)^+Q{x)y = 

and we can see that this DE is also separable — we have already tackled such DEs. A first order inhomogeneous 
linear DE is of the form 

P{x)^ + Q{x)y = R{x). (3.f) 

and these can be approached by rising integrating factors. 

• The idea is to multiply the LHS of (3.1) by a function I {x) of x which will make it the 
derivative of a product of the form A (x) y. 

Consider the first order inhomogeneous linear DE (3.1). In general, the LHS of (3.1) won't be expressible as 
the derivative of a product A (x) y. However, if we multiply both sides of the DE by an appropriate Integrating 

Factor / {x) then we can turn the LHS into the derivative such a product. Let's first of all simplify the equation 
by dividing through by P {x) , and then multiply by an integrating factor / (x) (which we have yet to determine) 
to get 

We would like the LHS to be the derivative of a product A {x) y for some function A (x) ; from the product rule 
A (x) y differentiates to 

A{x)^+A'{x)y. (3.3) 
So, equating the coefficients of y and y' in (3.2) and (3.3), we have 

I{x)Q{x) 



P{x) 

Rearranging this gives 



A{x)=I {x) and A' (x) = 

I'jx) ^ Q{x) 
I{x) P{x)' 
The LHS is the derivative of In / (x) and so we see 

nx) = e.pj^^dx. 
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(We are only looking for one such I (x) with this property; we do not need to worry about the constant of 
integration.) For this I {x) (3.2) now reads as 

dx^^'y^ P{x) 

which has the general solution 



1 f f I(x)R(x) ^ \ 



Example 25 Find the general solution of the DE 



dy , 2 
X- — \- (x-l)y = X . 
ax 



Solution. If we divide through by x we get 



and we see that the integrating factor is 



I (x) = exp / (l — — I da; = exp (x — Ina;) = — e^. 

J \ xj X 



Multiplying through by the integrating factor gives 



a; da; \x x"^ I 



which, by design, rearranges to 

d /I 



d^ [x'^y 1 = 
Integrating gives 

le^y = e''+K 

X 

where iif is a constant, and rearranging gives 

y{x)=x + Kxe~^ 

as our general solution. ■ 

Example 26 Solve the initial value problem 

^ + 2xy^l, y(0)=0. 
ax 

Solution. The integrating factor here is 

/ (a;) = exp J 2x dx = exp (a;^) . 

Multiplying through we get 

Noting that y (0) = 0, when we integrate this we arrive at 

e^\= fe*' dt, 
Jo 

and rearranging gives 

2/(a;) = e-^' / e*' dt. 
Jo 

The integral of e^^ can't be expressed in a closed form involving elementary functions (hence the need for normal 
distribution tables etc.) and we have to leave the answer in the given form. ■ 
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Example 27 Solve the initial value problem 

y^+smx = y'^, y{0) = l. 

Solution. This DE is neither hnear nor separable. However if we note that 

^ l_d_ , 2\ 

then we see that the substitution z = y^ turns the given DE into 

dz 

— -2z = -2smx, z(0) = l'^ = l 
dx 

which is solvable by integrating factors. In this case the integrating factor is e~^^ and we get 

(2;e"^^) = -2e"^^sina;. 
da; ^ ' 

Integrating the RHS by parts (in a similar fashion to Example 6) we get 

g-2x 

^e~^^ = — — (4 sin a; + 2 cos x) + C. 
5 

As z = 1 when a; = then C = 3/5 and so, recalling that z = y^, we have 



y = 



4 sin a; + 2 cos x + Se^^ 



being sure to take the positive root as y (0) = 1 > 0. The solution is valid on the interval containing for which 
4sina; + 2cosa; + 3e^^ > 0. ■ 



3.2 Second Order Homogeneous Linear Differential Equations 



A second order homogeneous linear differential equation is of the form 

Pi^)^ + Qi^)^ + Ri^)y = o- (3-4) 

We shall first consider the situation where, either by inspection or other means, we already know of a solution 
Y{x). 

• The idea is to make the substitution 

y{x) = Y (x) z (x) 

which transforms equation (3.4) into a first order DE involving z and x. 

Note 

dy_^ck dV ^_y'^ 2 — — — 

dx dx dx ' dx^ dx^ dx dx dx'^ 

Substituting these expressions into (3.4) we find 

Afz ^dY dz d^Y \ ^(,Az dY 



dx^ ^ da; da; ^ dx^ ^ 1 + Q I dx'^ dx^ RYz 
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which rearranges to 



Now the bracket 



d^Y ^dY 



Q— + RY 



dx^ dx 

equals zero as we know that y is a solution to (3.4). Further if we let w {x) = dz/dx then we can see (3.5) is a 
first order separable DE in w : 

PY^+(^P^+Qy)^^Q (3-6) 



da; \ dx 

which is a separable equation in solvable to find w, which we may integrate to find z and so y. 
Example 28 Show that u{x) = 1/x is a solution of 

Hence find the equation's general solution. 

Solution. It is easy to check that Y (x) = 1 jx is a solution as 

For this example equation (3.6) reads as 



^x ) dx \ \x^ J \x 

where w = dz/dx and z{x) /x = y{x) . Simplifying we have 

dw , „s 

- + (-2). = 0, 

and separating variables gives 

1— = 2 
w dx ' 

which we know from Example 16 to have general solution 

w = Ae^^. 

Integrating we have 

z = ae"^" + b 

(where a = A/2) and hence 

ae^^ + b 

y = zY = 

X 

is the original DE's general solution. ■ 

Example 29 Show that Legendre's equation (see Example 12) with m = 1 

has a solution of the form, Y {x) = ax + b and hence determine its general solution. 
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Solution. The function Y (x) = ax + b is a solution of the DE if 

(1 - x^) (0) - 2x (a) +2{ax + b)= 0. 

Equating coefficients we see that 

-2a + 2a = 0; 2b = 0. 

These show 6 = and that a can be any real; in particular F (a;) = a; is a solution. For this example, equation 
(3.6) reads as 

(1 - x') {x) ^ + ((2 - 2x2) (1) + {-2x) (x)) w = 0, 
where w = dz/ dx and z = y/x. The above simplifies to 



dw 



(x -x^)— + {2- Ax^) w; = 0. 
^ ' da; ^ ' 



Separating variables we find 



/^=/^d.= /f 

J W J X^ -X J \ 



-2 -1 -1 
X X — I x+l 



dx, 



givmg 
and 



In \w\ = —2 In \x\ — In la; — 1| — In la; + 1| + C, 



dz 



w = 



A 



1 1 
+ 



1 



da; a;2 (a; - 1) (a; + 1) \x'^ 2 (a; - 1) 2 (a; + 1) ^ 
Hence the general solution is 

At t — 1 

+ Bx. 



y (x) = a;^; (a;) = A + — In 



x+l 



3.3 The Linear Algebra behind Linear Differential Equations 



Recall that a homogeneous linear DE of order k is one of the form 

d*y d'^'^^w dy 

d^ + ^""^ d^ + ---+ai{x) — +ao{x)y = 0. 

As you will meet in other areas of mathematics, especially in the linear algebra courses, the space of solutions 

has some nice algebraic properties. 

Theorem 30 Let yi and 2/2 be solutions of a homogeneous linear differential equation and ai , a2 be real num- 
bers. Then aij/i + a22/2 is also a solution of the DE. Note also that the zero function is always a solution. This 
means that the space of solutions of the DE is a real vector space. 

Proof. We know that 

d^yi d^^^yi dyi 

o-k [x) + flfc-i (a;) + ■ ■ ■ + ai (a;) — + ao (a;) y2 = 0. (3.8) 

If we add ai times equation (3.7) to a2 times equation (3.8) and rearrange we find 

ttfe [x) ^ \ h ai (x) — h ao (a;) (aiyi + a2y2) = 0, 

which shows that aij/i + 022/2 is also a solution of the DE. ■ 
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Remark 31 The fact that all the above holds relies on nothing more than the rules 

d . , ^ . df dg d df 
d^(^ + ^^=d^ + d^' d^("^) = "d^ 

for functions /, g and real numbers a. These rules say that differentiation is a linear map. 

In the case when the DE is linear, but inhomogeneous, solving the inhomogeneous equation still strongly 
relates to the solution of the associated homogeneous equation. 

Theorem 32 Let Y (x) be a solution, known as a particular solution, or particular integral, of the inho- 
mogeneous linear DE 

d^y d^~^y dy 

dx^ ^ '^^^^ ^^'^ dx^ + ai{x) — + aQ{x)y ^ f {x) . (3.9) 

That is y = Y satisfies the above. Then a function y (x) is a solution of the inhomogeneous linear DE (3.9) if 
and only if y (x) can he written as 

y{x) = z (x) + Y (x) 
where z (x) is a solution of the corresponding homogeneous linear DE 

d^ z d^^^ z dz 
Ofe {x) ^ + ak-1 {x) ^^fcZT H h ai (a;) — + ao (a;) 2; = 0. (3.10) 

The solution z (x) to the corresponding homogeneous DE is known as the complementary function. 

Proof. If y{x) = z (x) + Y (x) is solution of (3.9) then 

, ,d''{Y + z) ,,d'=-i(y + 2) , ,d(Y + z) , ,,,, , , 

^""^ dxfe + dxk-^ + • • • + ai (x) ' + ao {x) {Y + z) = f (x) . 

Rearranging the brackets we get 

d''z \ f d'^Y \ 
a/c(x)^H \-ao{x)z\ + [^''{x)-^ + \-ao{x)Y\ = f {x) . 

Now the second bracket equals / (x) as F (x) is a particular solution of (3.9). Hence the first bracket must equal 
zero — that is z (x) is a solution of the corresponding homogeneous DE (3.10). ■ 

Remark 33 In practice, a particular solution is usually found by educated guess work and trial and error with 

functions that are roughly of the same type as / (x) . 

Remark 34 The space of solutions of (3.9) is not a vector space. They form what is known as an affine 
space. A homogeneous linear DE always has as a solution, whereas this is not the case for the inhomogeneous 
equation. Compare this with 3D geometry: a plane through the origin is a vector space and if vectors a and 
b span it then every point will have a position vector Aa + /xb; points on a plane parallel to it will have position 
vectors p + Aa + /ub where p is some point on the plane. The point p acts as a choice of origin in the plane, 
playing the same role as Y in the above. 

Example 35 Find the general solution of 

'''B+'^'-^^rx-'y='''- ^'-''^ 

Solution. We already showed in Example 28 that the general solution of the corresponding homogeneous 
equation 

is 

y[x) = . 
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So we just need to find a particular solution of (3.11). A reasonable first attempt would be to see if there is a 
solution of the form 

Y (x) ^ Ax + B, 

considering that the RHS equals 12x and that a function like Ax-\-B differentiates to a similar type of function. 
Such a F is a solution if 

xxQ + 2{l-x)A-2{Ax + B) = I2x. 

Rearranging this becomes 

{2A - 2B) - AAx = I2x. 

So we see that A = — 3 and B = — 3. That is F (x) = —3a; — 3 is a particular solution. So the general solution 
of (3.11) is 

ae^^ + b 
y (x) = 3a; — 3. 

X 

m 

We will meet further examples of inhomogeneous linear DEs in Section 4.2. 



3.4 Exercises 



Exercise 49 Use the method of integrating factors to solve the following equations with initial conditions 

dy 

- — \-xy = X where y (0) = 0, 
dx 

2x^^-3x\ = 1 where w(l)=0, 
dx 

dy 

wtanx = 1 where y(0) = l. 

dx 



Exercise 50 Solve the following differential equations: 

dy 
dx 



(l-x^)^ + 2x, = (l-x^f^: 



dw 

(cotx)y + cscx = 0. 

dx 

Exercise 51 By treating y as the independent variable, solve 

Exercise 52 Solve the following differential equations: 

dy 1 



dx cos y — X tan y 

dy _ 1 
dx ev — x' 

dy 3y 



dx 3j/2/3 — X 

Exercise 53 * Show that multiplying the equation 



xy^ + (2x2 + y + x)=0 
one of the fori 
{F{x,y))=0 



by the integrating factor x turns the equation into one of the form 

d_ 
dx 

where F (x, y) is a polynomial in x and y. 
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Exercise 54 Show that u{x) — l/x^ is a solution of 



^dJ-(^ + ^^£ + 2^ = °- 
Hence find the equation's general solution. 

Exercise 55 Find a particular solution of 

-^0-(- + 2)| + (x + 2), = O 

and hence find the general solution. 

Exercise 56 Find a particular solution to the differential equation 

d^y ^ „d2/ 

d^+'d^+'^ = ^^")' 
for each of the following different choices of f {x) : 

f{x) = x^; f{x) = e''; f{x)=smx. 

Exercise 57 Find a particular solution to the differential equation 

d^y d^y d^y « , • 

dx'^ dx-' dx^ 

Exercise 58 Find a particular solution to the differential equation 
when (i) f (x) =sin^a;, (ii) f (x) =sina;, (Hi) f (x) =a;sin2a;. 

Exercise 59 * ^ mass P swinging on the end of a light rod is governed by the differential equation 

d^e g 

dt^=-r^"^- 

The pendulum starts from rest at 6 = n. 

(a) Let u) = dO/dt. Show that d^^O/dt^ = u)doj/d9 and deduce that 

= y (cos 9 — COS a) . 

(b ) Hence show that the time T of an oscillation equals 

, . . de 

T = 44 



2^ Jo V cos 9 — cos a 

Show that if only small oscillations are performed so that the approximation cos 9 « 1 — ^^/2 applies for 

^ 9 ^ a, then 

Exercise 60 * Bernoulli's equation has the form 

^+P{x)y = Q{x)y^ 

Show that Bernoulli 's equation can be reduced to an inhomogeneous linear first order equation by means of the 
substitution z = y^"". 
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Exercise 61 * Solve the following differential equations 



da;^ ^ ' dx X ^' ^da; *' 

Exercise 62 * Clairaut's equation has the form 

^ ^ dx ^ (da; 

Show that, by differentiating with respect to x, the equation can be reduced to a first order differential equation 
in dy/dx. 

Hence solve the differential equation 



^ ^ da; V da; 
Exercise 63 * Find a parametric solution of 

as follows. 

(a) Write an equation for y in terms ofp = dy/dx and x, and show that 

p = p^ + {2px + l)^. 

(b) Using p as the independent variable, arrange the above as a linear first- order equation for x. 
(c) Using an appropriate integrating factor show that 

In I? — p + c 

where c is a constant. Together with the expression for y in terms of p from part (a) we have a parametric 
solution {x (p) ,y{p)). 

Exercise 64 * An engineer builds himself a unicycle with a square wheel of side D, and wonders now what 
type of road surface he should make in order that he can ride his unicycle while remaining at a constant height. 
A mathematician friend of his says a road surface with the equation 

y = -Dcosh^ (x/D) 

would work and gives her reasoning as follows: 

Call the road surface {x (.s) , y (s)), parametrised by arc-length and without loss of generality start the middle 
of the base of the wheel at (0, 0) with the base parallel to the x-axis. 

(a) Let's now say the square wheel has rolled distance s. Show that the centre of the wheel is at 

{x (s) ,y{s))-s {x' (s) , y' (s)) + ^ {-y' (s) , x' (s)) . 



(b) Hence show that a rider's height will remain constant if 

y (s) - sy' (s) + y-^l-y' {sf = y . 



(c) By differentiating this equation with respect to s, or otherwise, find y (s) and x (s) , and hence show that 
the mathematician's answer is correct. 
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4. LINEAR ODES WITH CONSTANT 
COEFFICIENTS 



In this chapter we will look to treat the theory of solving linear differential equations 

d^y d^^^y dy 

^'d^+"''-'d^+---+'''di+^'y^^^''^ 

where the functions ao,ai, . . . ,ak are constants. 

We have already seen in Theorem 32 that the difference between solving the inhomogeneous and homogeneous 
is in finding a particular solution, so for now we will concentrate on the homogeneous case. 

Mainly we shall treat examples when the equations are second order, though the theory extends naturally to 
similar higher order equations. We begin with the example of simple harmonic motion (SHM). This is the 
equation describing the vibrating of a spring or the swinging of a pendulum through small oscillations. The DE 
governing such motions is 

where w is a positive constant. 

Example 36 Show that the general solution of (i-l) is of the form 

y{x) = A cos ujx + B sin wx. 

The constant lj is the angular frequency of these oscillations, with the solutions having period 2-k/w. 

Solution. We firstly set v — dy/dx. By the chain rule 

d?y dv dy dv dv 
dx'^ dx dx dy dy 



and the differential equation (4.1) becomes 



dv 2 
V— = -uj y 
dy 



which is separable. If we separate the variables and integrate we find 

-V =--uy +K. 



Recalling v = dy/dx we have 



dy _ /r^ . 0.2 



dx 

Again this is separable and we may solve this to find 



-I 



dy 



^/K-cj^y^' 

We met such integrals in the very first section on Trigonometric Substitutions, a sensible one here being 



VK . ^ 
y = sm t 

CO 



LINEAR 

COEFFICIENTS 



ODES 



WITH 



CONSTANT 
35 



dt = - + L, 



which simpUfies the integral to 

/^V^/wj cos 
v^cost w 
for some constant L. RecalHng y — (^^/K/lo^ sint we have 

^ ■ f 

y = smoj (x — L) 

w 

or alternatively 

y = vlcoswa; + -Bsinwa; 
for constants A and B, by using the sin (a + /?) formula. ■ 



4.1 The Homogeneous Case 



The SHM equation is a special case of the more general DE we are interested in, which is treated by the following 
theorem. 



Theorem 37 Consider the DE 

where Q and R are real numbers. This has auxiliary equation (AE) 

+ Qm + i? = 0. 

Its general solution is: 

1. in the case when the AE has two distinct real solutions a and p: 

y{x) = Ae""" +Be^''; 

2. in the case when the AE has a repeated real solution a: 

y{x) = (Ax + B) e«"; 

3. in the case when the AE has complex conjugate roots a + ip and a — i/S: 

y (x) = e"^ (A cos px + B sin ^x) . 

Note that the following proof is not typically examined and only a knowledge of the above solutions is expected. 



Remark 38 Note, in the previous SHM example, we tackled the case when the auxiliary equation's roots are 
ibwz. I have aimed in the following to present proofs that will suit both those who are au fait with complex 
numbers and those who arc; not. Those with no or limited knowledge of complex numbers need only recognise 
that quadratic equations which don't have real roots may be written in the form {x — a)^ + 0^ = Q for some 
real a and For those who have met Euler's result that 



e = cos 6* + isin6', 
the form of the solution in case three will be less surprising. 
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Proof. Cases 1 and 2: Let's call the roots of the AE a. and /3, and assume for the moment that they are real 
roots, but not necessarily distinct. We can rewrite the original DE (4.2) as 

Firstly note that Y {x) = e^^ is a solution as substituting this in gives 

^^e^^ - (a + /3) /3e'^^ + a^e^^ = 0. 

We saw in Section 3.2 how knowledge of a solution can simplify a second order homogeneous linear DE — if we 
set z{x) = y {x) e~^^ then from equation (3.6) we have 



e^^^ + (2/3e'^" + {-a - /3) 6^"=) w = 



.dw 
dx 

where w = dz/dx. Rearranging somewhat we find 

dw , „s , ^ 

— = ia-l3)w. (4.3) 

We now have two cases to consider: when a — P and when a ^ In the case when the roots are equal then 
(4.3) leads to the following line of argument 

w{x) = dz/dx = A (a constant), 
z (x) = Ax + B {A and B constants) , 
y{x) = z (x) e^"" = {Ax + B) e^^ , 

as we stated in Case 2 of the theorem. 

In the case when the roots are distinct reals then (4.3) has solution from Example 16 

u,(ar) = ^=cie(«-«- 
da; 



(where ci is a constant) and so integrating gives 



^(x) = ^e(«-'^^^+C2 
a — p 

(where C2 is a second constant) to finally find 

y{x) = z (x) e^^ = ^^e"^ + cae^^. 

Case 3: Suppose now that the roots of the equation are conjugate complex numbers a ± For those who 
are aware of Euler's relation 

e'^ = cos ^ + z sin 6 

then we can simply treat this case the same as Case 1. Allowing for A and B to be complex numbers then (4.2) 
has a general solution of the form 

y {x) = Ae("+*'^)^ + Be^"-'^'^"" = e" {A' cos px + B' sin (5x) , 

for constants A' and B' . 

Alternatively here is a proof which does not rely on complex numbers or Euler's relation. The original DE 
(4.2) when the AE has roots a ± i/3 is 

g-2„^ + („"+^^), = 0. (4.4) 

We will first make the substitution 

z{x) = y{x)e-''\ 
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Though it is not the case that e"^ is a solution this substitution will transform the DE into something familiar. 
Note 

da; da; da; dx^ da;^ da; 

Hence (4.4) has become a new DE involving z (x) 



, ,e"^ +2a— e"^ +a^2e"^ -2a — e"^ + aze"^ + + ze"^ = 0, 
dx^ dx / Vdx 



which simplifies to 



dx^ 

Dividing through by e"^ gives 

which we recognise as the DE for SHM. This has general solution z = A cos 0x+B sin l3x and so we can conclude 

y (x) = {A cos Px + B sin /3x) 

as required. ■ 

Example 39 Find the general solution of the differential equation 

— -6— +9 =0 

dx"^ dx 

Solution. This has auxiliary equation 

= - 6m + 9 = (m - 3)^ 
which has a repeated root of 3. Hence the general solution is 

y={Ax + B) e^^ 

■ 

Example 40 Solve the equation 

— -3— +2 =0 
dx"^ dx ' 

with initial conditions 

y(0) = l, y'(0)=0. 

Solution. This has auxiliary equation 

= TO^ - 3rn + 2 = (m - 1) (m - 2) 
which has roots m = 1 and m = 2. So the general solution of the equation is 

y (x) = Ae^ + Se^^. 

Now the initial conditions imply 

1 = y{G)=A + B, 
= y' {Q)=A + 2B. 

Hence 

A = 2 and B = -1. 
So the unique solution of this DE with initial solutions is 

y(x) = 2e"-e2^ 
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Example 41 Find all solutions of the differential equation 

Sy 2*^^+2 
dx"^ dx 

which satisfy the boundary conditions 

y (0) = and y (tt) = 0. 

Solution. This has auxUiary equation 

= - 2m + 2 = (m - 1)^ + 1 
which has roots m = l±i. So the general solution of the equation is 

y (x) = (A cos a; + -Bsina;) . 

Now the boundary conditions imply 

= 2/(0) = A 
= y (tt) = -e^'A. 

Hence ^ = and there are no constraints on B. So the boundary- value problem has solutions of the form 

y {x) = Be^ sin a: 

for any B. m 

The theory behind the solving of homogeneous linear DEs with constant coefficients extends to all orders, 
not just to second order DEs, provided suitable adjustments are made. 

Example 42 Write down the general solution of the following DE 

d^y (fy (fy ^ cfy , d^y .dy , 

— - H - 5 — - + 4 — - + 4— - 4y = 

dx^ dx^ dx^ dx'^ dx"^ dx 

Solution. This has auxiliary equation 

m'^ + - - 5m* + 4m^ + 4m - 4 = 0. 
With can see (with a little effort) that this factorises as 

(m- 1)^ (m^ + 2m + 2)^ = 

which has roots 1 , — 1 + i and —1 — i, the first being a triple root and the latter two double roots. So the general 
solution of the DE is 

y (x) = {Ax'^ +Bx + C)e=' + {Dx + E) e"^ cos x+{Fx + G) e"^ sin x. 



4.2 The Inhomogeneous Case 



In the previous section we discussed homogeneous linear differential equations with constant coefficients — that 
is equations of the form 

d'^y , A'^-^y , , dy 



^feTT , u 1 + +oo2/ = 0. 

da;'' da;''"^ da; 
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These equations occur naturally, for example the simple harmonic motion equation 

governing the oscillations of a spring freely vibrating. (Such an equation arises from Hooke's Law). However if 
the oscillations are being driven at another frequency Q. the equation could now look like 



+ LO^y = Asm fit, 



d 2/ , , ,2 



di2 

which is an inhomogeneous linear DE with constant coefficients. As has already been noted in Theorem 32: 

• The solutions y (x) of an inhomogeneous linear differential equation are of the form z {x)+Y (x) 

where z (x) is a complementary function, i.e. a solution of the corresponding homogeneous 
equation, and Y (x) is a particular solution of the inhomogeneous equation. 

The particular solution Y (x) is usually found by a mixture of educated guesswork and trial and error. 

Example 43 Find the general solution of 

Solution. As the function on the right is f (x) = x then it would seem sensible to try a function of the form 

Y (x) =Ax + B, 

where A and B are, as yet, undetermined constants. There is no presumption that such a solution exists, but 
this seems a sensible range of functions where we may well find a particular solution. Note that 

d^ . J d^y ^ 
— — = A and — — ^ — 0. 
da; da;^ 

So if Y (x) is a solution of (4.5) then substituting it in gives 

0-3A + 2{Ax + B)=x 

and this is an identity which must hold for all values of x. So comparing the coefficients of x on both sides, and 
the constant coefficients, 

2^ = 1 giving -4 = ^, 
3 

-3A + 2B = giving B = -. 

What this means is that 

^(^)=2+4 

is a particular solution of (4.5). Having already found the complementary function, that is the general solution 
of the corresponding homogeneous DE in Example 40 to be Ae'' + Be^^, then by Theorem 32 we know the 
general solution of (4.5) is 

y{x) = Ae'=+Be^^ + ^ + ^, 

for constants A and B. m 

Example 44 Solve the initial value problem 
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Solution. Prom Example 39 we know that the general solution of the corresponding homogeneous equation is 

y={Ax + B) e"". 

This means that trying neither y = e^^ nor y = xe^^ as a particular solution is worthwhile as substituting either 
of them into the LHS would both yield 0. Instead we will try a particular solution of the form Y {x) — Ax^e^'^ . 
For this Y we find 

—5- - 6— + 9F = I (2 + 12a; + Qx^) - 6 (2x + Sx^) + Qx^j Ae^^ = 2Ae^'' . 
dx^ da; ' ^ ' ' 

Hence a particular solution is Y {x) = |a;^e^^. The general solution of the given inhomogeneous DE is 

' " * 3a; 



y{x)=y— + Ax + B]e: 

Now y (0) = B = 1; as 

y' {x)^(x + A+^-+ 2,Ax + iB ) e^^ 



2 

then y' (0) = A + iB = 1 and so A = —2. Hence the initial value problem has solution 



a;2 



(y-2x + l)e3^ 



Example 45 Find particular solutions of the following DE 



d.^ ^dl+2^ = ^(^) 



where 



• / (a;) = sin x — Simply trying Y (x) = A sin x would do no good as Y ' (x) would contain cos x terms 
whilst Y (x) and Y " (x) would contain only sin x terms. Instead we need to try the more general Y (x) = 
Asmx + B cos x. 

• / (a;) = e^'^ — This causes few problems and, as we would expect, we can find a solution of the form 

• / (x) = — This is different to the previous case because we know Ae^ is part of the general solution to 
the corresponding homogeneous DE, and simply substituting iny = Ae^ into the LHS will yield 0. Instead 
we can successfully try a solution of the form Y (x) = Axe^. 

• / (x) = xe^^ — Again Ae^^ is part of the solution to the homogeneous DE. Also as with the previous 

function we can see that Axe?^ would only help us with a e^^ term on the RHS. So we need to move up a 

further power and, try a solution of the form Y (x) = ( Ax^ + Bx) e^^ . 

• / (x) = sin x — Though this may look somewhat more complicated a particular solution of the form 

Y (x) = (^sinx + Bcosx) 

can be found. 

• / (x) = sin^ X — Making use of the identity s\v? x = (1 — cos 2x)/2 we can see that a solution of the form 

y (x) = A + B sin 2x + C cos 2x 

will work. 
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4.3 Exercises 



Exercise 65 Determine exactly the particular solutions from amongst the suggested families of function in 
Example 45- 

Exercise 66 Find the most general solution of the following homogeneous constant coefficient differential equa- 
tions: 

T^~y = ^'^ TT+42/ = 0; -7-^+ 3— +2y = 0; — ^ + 2— + 3y = 0. 
da;"' ax'' ax'' ax dx^ da; 

Exercise 67 Find the most general solution of the following higher order homogeneous constant coefficient 
differential equations: 

dx^ ' da;^ ' da;'^ dx^ dx ' dx'^ dx^ 

Exercise 68 Solve the following initial-value problems 

(i) 0+y = e^cosa: y (0) = 1, y' (0) = 0; 

d^v 

(ii) d^ + = ''"^^ ^ ^ = °' y' = ^■ 

Exercise 69 Solve the initial-value problem 



Exercise 70 S'otoe the initial-value problem 

- y = y^^^ (0) = 1 /or =^ fc < 2007. 

Exercise 71 By means of the substitution z ^Inx find the general solution of the differential equation 

^d^y dy 
X +x— + y = 0. 
da;^ da; 

Exercise 72 Find the general solution of 

Exercise 73 Find all the solutions (if any) of the following boundary-value problems: 

S^'''^' 2/(o) = i,y(i) = -i; 
(n) = -'^'2/' y(0) = i,y(i) = -i; 

(in) ^ = y (0) = l,y (1) = 0. 

Exercise 74 Show that, for any values of a and 0, the function 

y (x) = sinx f (t) cos tdt — cos x f (t) sin t dt 

Ja Jp 

is a solution to the differential equation 



Hence solve the boundary-value problem 

d^y 



^^2+y = cscx, y (0) = y (7r/2) = 0. 
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Exercise 75 Show that the solutions of the differential equation 

-r-^ +0J X = cosilt 
at-' 

are bounded when Cl ^ w, but become unbounded when Cl = uj. 

Exercise 76 Find the most general solution of the following inhomogeneous constant coefficient differential 
equations: 



d^y .rdy.^ 

ax^ ax 



Exercise 77 Find the general solutions of the differential equation 

when (i) a,b,c are distinct real numbers, (ii) a and b are distinct real numbers and c = a. 
Exercise 78 Write down a family of trial functions y (x) which will contain a particular solution of 

for each of the following different choices of f (x) : 

(i) f [x) = x"^, (ii) f (x) — xe^, (Hi) f (x) = xe~'^^, (iv) f (x) = x^ sinx, (v) f {x) = sin^ x. 



Exercise 79 By making the substitutions x = e* and y = v{t)e* , or otherwise, solve the differential equation 



Exercise 80 Chebyshev's equation is 

where n is a non-negative integer. By making the substitution x = cos 9, show that Chebyshev's equation has 
two independent solutions 



Tn {x) = cos (n cos ^ x) , Vn{x) = sin (n cos ^ x) . 



Show that 



Tn (x) + iVn {x) = (x + i^/l^Y 
and deduce that Tn (x) is a polynomial of degree n. 

Exercise 81 * Let ao,ai, . . . ,ak be real constants and suppose that the polynomial 



z'^ + ak-iz'^ ^ -\ Vaiz + ao=0 



has k distinct real roots 


7i,72> 


• * • 5 


7fe- 






(a) Show that 


















/ 


1 


1 


1 


1 \ 








7i 


72 


73 ■ 


Ik 




det 




ll 


7i 


ll ■ 


■ ll 






V 






IV ■ 


■ it') 



= n (7,-7.). 



(b) Deduce, that for any cq, ci, C2, . . . , ci~-i, the initial-value problem 

dy 



d^ d'^-^ 
dx'^ 

has a unique solution. 



dx^ ' "^-i^^ + '" + ''i^ + ''o2/ = °' y(0) = co,y'(0) = ci,...,y('=-i)(0) = Cfe_i 
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Exercise 82 * Assuming that term-by-term differentiation is valid within a power series' range of convergence, 
show that the power series 

oo 

y{x) = ^akx'' (4.6) 

fe=0 

is a solution of the initial-value problem 



provided 



0+j/ = O, y(0)=y'(0) = l, 



^ik + 2)Tfc + 1) ' '^"^ «o = ai = l. 



Hence determine the coefficients ah. Solve the above initial-value problem and show that the solution agrees with 
the power series (4-6) above. 

Exercise 83 * Use the power series method of Exercise 82 to find the general solution of the differential equation 

dx2 (1 _ 

as a power series. For what range of x -values does your series converge? 
Exercise 84 * Bessel's equation reads as 

X --^ -\-x— -\- (X -v)y = 
dx'' ax ^ ' 

where v ^ 0. 

(a) Show that if v is an integer then 

n=0 ^ ' 

is a solution of Bessel's equation. 

(b) By making the substitution y = or otherwise, find the general solution to Bessel's equation when 
V = 1/2. 

Exercise 85 * The differential equation 

is known as Legendre 's equation. Find a recurrence relation amongst the coefficients of a power series solution 

Show that, if n is a non-negative integer, there is a polynomial solution Pn{x) of Legendre 's equation of 
degree n. 

This polynomial is unique up to scalar m,ultiplica,tion and so it is conventional to further assume that Pn (1) = 
1. These polynomials are known as Legendre polynomials. Find Pn (x) for ^ n ^ 4. 

Exercise 86 * Show that Legendre 's equation can be rewritten as 



_d_ 
dx 



n{n+l)y = 0. 



Deduce, that for n > m > 0, 

»i 

Pn {X) Pm {X) dx = 0. 

-1 
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5. SYSTEMS OF LINEAR DIFFERENTIAL 
EQUATIONS 



5.1 Some Facts relating to Matrices 



We introduce here some basic definitions and properties relating to matrices. We will use matrices later to work 
with simultaneous differential equations like 

dx dy 

_=2x + 3y, ^=3x + 4y, 

though the theory extends more generally to n linear differential equations with constant coefficients in n 

variables. 

Definition 46 A (real) mx n matrix is an array of real numbers arranged into m rows and n columns. This 
array is typically placed inside brackets and we write 



I an 



A = 



ai2 ai3 



\ CLml O-ml O-mS 



din \ 



1 



-3 TT 

2.5 



The notation aij denotes the entry in the ith row and jth column. 
Example 47 Let 

A = (aij) 

This is a2 X 3 matrix — ai2 = —3 and 021 = e^. 

In the main we shall only be interested in 2 x 2 matrices. 
Definition 48 Let 



A = 



an 

0.21 



ai2 

^22 



and B = 



bn 
b2i 



bi2 

b22 



be two 2x2 matrices. These matrices can be added to form the sum A + B and multiplied to form the product 
AB as follows. 



A + B = 
AB = 



aii+bii ai2- 
021 + &21 022 - 

aii6ii +012621 
0121611 + 022621 



612 
622 

aii6i2 + 012622 
021612 + 022622 



Note, for example, that the 1st row, 2nd column entry in AB is calculated by taking the term by term products 
of the 1st row of A and 2nd column of B and adding these products (note that this is just the same as taking 
the dot product of the first row and second column) — that is 



On 



012 



O21 O22 



611 
621 



612 



^22 



011611 + 012621 O11612 + O12622 



O21611 + 022621 O21612 + 022622 



If c is a real number, often referred to as a scalar in this context, then we can also form the scalar multiple 
cA by multiplying each entry of A by c, i.e. 



cA 



can 
CO21 



cai2 
CO22 
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Remark 49 More generally it is possible to 



• add an mi x m matrix to an 7712 x 712 matrix if rui = 7712 and 7ii = 712 — the (i, j)tli entry in the sum is 
the sum of the matrices' {i,j)th entries; 

• multiply an rni x m matrix by an TO2 x ^2 matrix if m = m2 to produce an mi x 712 matrix product — 
the (i,j)th entry in the product is calculated by taking the term by term products of the ith row of the 
first matrix and the jth column of the second and adding them, i.e. we take the dot product of the ith 
row and jth column; 

• multiply any m x n matrix by any real number c to form a scalar multiple — the (i,j)th entry in the 
scalar multiple is c times the {i, j)th entry in the matrix. 



Example 50 Let 

Fi7id A + B,B + A,AB aTid BA. 
Solution. Then 

A + B = 



A={ I J and B=( ^ 



2 + l + 3\/2 4 
-1-2 3 + 5]~l-3 8 



B + A = 
AB = 
BA = 



+ 2 3 + l\/2 ' 
-2-1 5 + 3 -3 I 

2 iWO 3\_/0-2 6 + 5 \ / -2 11 

-1 3 ) \ -2 5 ) ~ yO-6 -3 + W ) ^ \ 6 12 

3\/2 l\/0-3 0+9 \/-3 9 
-2 5 j I -1 3 j ~ I -4-5 -2 + 15 j ~ i -9 13 



3A = ( I M; 3E V 9 



3 9 y ' V -6 15 

3A + 3B = { ll)=3{A + B) 



Example 51 • Note that A + B = B + A and this is generally the case; however AB 7^ BA in 
this case and we see that matrix multiplication is not generally commutative. 

• On the other hand matrix multiplication is always associative. That is 

(AB) C = A (BC) 
for any 2x2 matrices A, B and C. (We shall 7iot prove this here.) 

• The distributive law 

aA + aB = a{A + B) 
holds for all matrices A, B and reals a, as does the other distributive law 

{a + b)A = aA + bA 

for all matrices A and reals a, b. 
Definition 52 The determinant 0/ a 2 x 2 matrix A written AeiA or \ A\ is 

, . , / Gil ^12 \ 

det A = det = aiia22 - ai2a2i- 

\ a21 0-22 
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Remark 53 Determinants can more generally defined for n x n matrices. In the 3x3 case 

(an ai2 ai3 \ 
a21 022 a23 = 0110122033 + 012023031 + O13O21O32 - Oiia23032 - O12O21O33 - 013022031. (5.1) 
031 032 033 / 

Example 54 With A and B as in the previous example we have 



detA = det|^ __^ g j = 2 x 3 - 1 x (-1) = 7; 
detB = det( ^ M = x 5 - 3 x (-2) = 6; 



2 




-1 


3 





3 


-2 




2 


4 ■ 


-3 


8 


-2 


11 


6 


12 



det(.4 + S) = det _g ^ j = 16 + 12 = 28 7^ detA + detS; 
det (AB) = det ( ~^ |M = -24 + 66 = 42 = det A det B. 



Proposition 55 Let A and B he two 2x2 matrices. Then 

det {AB) = det A X det B. 



Proof. 



Then 



1 . / ^ ^>^ ^ , f ae + bq af + bh \ 

= (ae + bg) {cf + dh) - (a/ + bh) (ce + dg) 

= bgcf + aedh — bhce — afdg 

= {ad - be) {eh - fg) 

= det A det B. 



Definition 56 The 2x2 identity matrix is denoted by I (or I2 if we wish to stress the 2 x 2 context) and 
equals 

vol 

It has the property that I A = A = AI for any 2x2 matrix A. 

Definition 57 The 2x2 zero matrix is denoted by (or O2 if we wish to stress the 2x2 context) and equals 

V 

It has the properties that OA = = AO and A + = A = + A for any 2x2 matrix A. 

Definition 58 Given a 2 x 2 matrix A then an inverse matrix for A (which may or may not exist) is a2x 2 
matrix B such that 

AB = I = BA. 

If A has an inverse then it is unique (see below) and we denote it as A~^. 
Note if A~^ exists then 

1 = det / = det (AA-^) = det A det {A'^) 

means 

detA'^= ^ 



detA' 

If A has an inverse then it is said to be invertible, and is said to be singular if it has no inverse. 
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Proposition 59 ^2x2 matrix A has an inverse if and only if det A ^ 0. If det A then a unique inverse 
exists and equals 



1 



d 



ad - be \ —c a 

where A = ( ^, 
\ c d 

Proof. If A has an inverse B then 

1 = det/ = dct^B = det Adet5 
and so we see that det Aj^O. On the other hand if det A = ad — bc^O then we see 



1 



ad - be \ -c a 



a b 
a d 



= I and 



a, b 
c d 



1 



ad - be \ -c a 



I. 



So an inverse exists. 

Let's suppose two inverses B and C for A existed. That is there were two matrices B and C such that 

BA = I = AB and CA = I = AC. 
As matrix multiphcation is associative then 

C = IC= (BA) C = B {AC) = BI = B. 
Hence the inverse we have found is indeed unique. ■ 

Remark 60 Two vectors v, w are said to be linearly dependent or just dependent if one is a scalar multiple 
of the another — i.e. if they are parallel — and are said to be linearly independent or just independent 
otherwise. 

Note that ad = be if and only if (^) and (^) are parallel. So a 2 x 2 matrix is invertible if its columns (or 
equivalently rows) are independent vectors. 



Definition 61 The eigenvalues of a2 x 2 matrix A 



det {A - xl) = det 



a b 



^ 1 are the roots of the quadratic equation 



a — X b 
c d — X 



= [a — x){d — x) — be 

= x"^ — {a + d)x + {ad — be) = 0. 



Example 62 Find the eigenvalues of the following matrices 



Solution. 



2 1 
6 3 



det {A - xl) 
det {B - xl) 
det (C - xl) 



B 



2 1 
2 



C 



-1 

1 



2-x 1 
6 3 — a; 

2-x 1 
2-x 

-X -1 
1 -X 



x'^ — 5x = X {x — 5) : 
{x-2)'; 



x^ + 1. 



So we see the eigenvalues are 0, 5 for A, 2, 2 for B, and ±i for C. 
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Definition 63 Given a 2x2 matrix A and a 2 x 1 column vector v = (^) (we can think of as a 2x1 matrix) 
then we can form the matrix product Av which is another 2x1 column vector. Recall 



Aw- 



a b \ f X \ _ f ax + hy 
c d)\y)~\cx + dy 



In this way A defines a map known as a linear map. ( cf. Michaelmas Linear Algebra course.) This 

means that 

A (civi + C2V2) = ci^vi + C2AV2 
for vectors vi and V2 and scalars ci, C2. In particular, AO = 0. 

Example 64 Determine the following products. 

2 ^ (2 

6 3 J \ J \ 6 

6 3)(!) = (3 
2 1 \ f 2 \ f \ 2 



6 3 J \ -4 J \ J \ -4: 

• Note that the matrix maps (q) to its first column and (J) to its second column; it is easy to check that 
this generally happens. 

• Note, in the third line, that it is possible for a matrix to map a non-zero vector to the zero vector. 

• Note that in the third and fourth lines the vectors are mapped to scalar multiples of themselves, and that 
these scalar are ^'s eigenvalues from Example 62. 

Proposition 65 The vector equation Ax = has a non-zero solution x = v if and only if det A = 0. 

Proof. If det A^ then we know from Proposition 59 that A has an inverse A~^. So if Av = then 

v = A-^Av = A-^0 = 0, 
and we see is the only solution. On the other hand if det A = ad — bc = then 



a b \ I d \ I ad — be 
c d I [ -c j ^ [ 



0. 



So (^^) is a non-zero solution unless c = = in which case ( J'^ will do, unless a = b = c = d = Oia which 
case any non-zero vector will do! ■ 

Corollary 66 If x = X is a real eigenvalue of A, so that det {A — XI) = 0, then there is a non-zero vector v 
such that {A — XI) v = 0. That is 

A-v = Av. 

Definition 67 A non-zero vector v such that Av = Av for some scalar X is called an eigenvector or a 
X-eigenvector. X is the corresponding eigenvalue. 

• Note conversely if Av = fiv for some non-zero vector v then [A — jil) x = has a non-zero solution x = v 

and so by the previous proposition det {A — jil) = 0; in particular ^ is an eigenvalue of A. 

• Note that if v is an eigenvector then so is cv for any non-zero scalar c. 
ExEimple 68 Find all the eigenvectors of the matrices A, B, C from Example 62. 
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Solution. A has eigenvalues and 5. We need to solve the vector equations 



6 3 J \ y J ' \ 6 -2 J \ y 
The first vector equation says that 

2x + y = and 6x + 3y = 0. 

Note that the equations are really the same equation (being scalar multiples of one another) which is why they 
have non-zero solutions. We can set x = ci and then y = — 2ci Working similarly with the second case we see 
that A's eigenvectors are of the form 

ci ( ^2 ) ( 3 



B has repeated eigenvalue 2. We need to solve the vector equation 
which has solutions 



1 


Note that B does not have two independent eigenvectors. 
C has no real eigenvalues and so no real eigenvectors. ■ 

Definition 69 j4 2 x 2 matrix A = [ ^] is said to be diagonal if b = c = 0; that is it is of the form 



a 
d 



Theorem 70 Suppose that the 2x2 matrix A has distinct real eigenvalues Ai and A2. Let Vj for i = 1,2 be 

corresponding Xi-eigenvectors and let 

(V1|V2), 

that is P has columns vi and V2 . Then P is invertible and 

P-^AP = 



Ai 
A2 

y j =(vi|v2)(^ ^ ) =xv^+yv2 = 0. (5.2) 

A {x^fl + yv2) = xXiVi + yA2V2 = 0. (5.3) 

(A2 - Ai) yv2 = 0. 

As Ai ^ A2 and V2 ^ 0, then y = and hence x = 0. As (5.2) only has as a solution then P is invertible by 
Proposition 65. 

Finally 

AP = A (vi I V2) = (Ai VI I A2V2) = p( ° 



Proof. Suppose that 
Then 

Now (5.3) minus Ai times (5.2) gives 



A2 



and premultiplying by P ^ on both sides gives the required result. 



Example 71 Let A be the matrix given in Example 62. Determine whether there is a real matrix P, such that 
P~^AP is diagonal. Are there similar matrices P for B and/or C? 
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Solution. In Example 62 we showed that A had eigenvalues and 5 with corresponding eigenvectors (_^2) 
and (3) (see Example 68). From the previous theorem we know then there is such a P which has these two 
eigenvectors as its columns. We can check this directly works here: 

f 1 iyY2l\/l ^\^lf^-'^\{0 5\l/0 0\/0 0\ 
\ -2 3 J \6 3j\-2 3j~5\2 1 J \ 15 J 5\0 25j~\0 5j 

as required. A is said to be diagonalisable. 

B on the other hand has eigenvalues 2 and 2. If such a P existed then it would follow that 

(Why would this be the case?) But then we'd have 

B = P {21) P-^ = 2PP-^ = 21 
and this is not the case; for B no such P exists and B is NOT diagonalisable. 
Similarly for C, if such a real matrix P existed we'd have 

P-CP=(' °) 

and this is clearly not possible with both P and C being real matrices. ■ 



5.2 Simultaneous Differential Equations 



As promised, we shall now apply matrices to coupled differential equations, governing two functions x and y of 
a variable t, when the equations have the form 

dx dy 

— = ax + by, ~ ^^' ^' ^ ^ ^) • 

We already have two ways of approaching such systems, and we will meet a third way relating to the matrix 
theory we have recently met. We shall work with a specific example with initial values 

dx dv 

— =3x + y, -^ = 6x + 4y, a; (0) = y (0) = 1. 

METHOD 1: If we use the first equation to write y = dx/dt — 3x, and substitute this into the second equation 
we have 

d X „dx „ .da; 



and hence 



dt^ 'Tt='^ + 'Tt-''^' 



d^x „dx 

W-'dt+'^ = '- 



with x{0) = 1 and x' (0) = 3a; (0) + y (0) = 4. We met in Chapter 4 how to solve such linear DEs with constant 
coefficients. 

METHOD 2: Alternatively we could note 

dy dy / dx 3x + y 



dx dt I dt 6x + 4y 
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and this is a homogeneous polar equation which we met previously in Section 2.4 and can be solved with a 
substitution of the form y (x) = xv {x) to give an equation relating x and y. By substituting our expression for 
y back into the original equations in principle it should be possible to determine x and y in terms of t. 

METHOD 3: Our third method relates to the linear algebra we met in the previous section. We can rewrite 
our two equations as a single differential equation in a vector v = (^) , namely 



dv 

dt 



dx/dt 
dy/dt 



'ix + y 
6x + Ay 



3 1 
6 4 



V. 



Let's write A 



3 1 
6 4 



. If we find A's eigenvalues, these turn out to be 1 and 6 and two corresponding 



independent eigenvectors are ( j^j) (3) respectively. If we put these eigenvectors into the columns of a matrix 



1 1 
-2 3 



so that P-'^AP - 



1 
6 



then we have 



dv 

di 



= Av = P 



1 
6 



Hence, because the entries of P -"^ are constant, 



d P- 



1 
6 



(P-V) . 



If we set (^) = P ^v, so that X and Y are new functions in t, then we have 

dY 



with 



Thus 



X(0) 

y(o) 



= p- 



dX _ 
~dt ~ 

x(0) 

y(o) 



X and 



dt 

3 
2 



QY 



2/5 
3/5 



X (t) = -e* and Y {t) = -e' 



6t 



and finally 



vit) '~ 



X{t) 

Y{t) 



1 1 

-2 3 



5^ 

3 6* 
5^^ 



More generally for such systems of differential equations we have: 
Theorem 72 Two variables x (t) and y (t) satisfy the differential equations 



dx 



= ax + by, 



dy 
dt 



= cx + dy. 



(5.4) 



If the matrix A 



a b 



^ j has distinct real eigenvalues Ai and A2 with corresponding eigenvectors vi and V2 



then the general solution of (5.4) is 



x{t) 

y{t) 



A2t, 



(5.5) 



Proof. The proof is simply a duplication of the working shown in Method 3 to the general case and so is 
omitted. ■ 



Whilst we won't go through the general details of what happens if the eigenvalues are repeated or complex 
we will treat two such examples below. Though it is worth noting 
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• Formula (5.5) still holds when the eigenvalues Ai and A2 are complex provided we cdlow the 
eigenvectors Vi,V2 to be complex and also the constants A and B. The solution Cctn then be 
written as an overtly real function using the identity 



cos^ + isin^. 



Example 73 Consider the linear system 



— =2x + y, -^ = -Ax + 6y, a; (0) = y (0) = 1. 
Solution. The matrix A of the system is 

2 1 
-4 6 

and has repeated eigenvalue 4. We now make the substitutions 

X{t) = x{t)e-^\ Y{t)=y{t)e-^K 
(More generally this would read X (t) = x (t) e~^*, etc. when A is the repeated eigenvalue.) Then 

X (t) = (i; - 4x) e"'** = (-2x + y) e"^* = -2X + Y, (5.6) 
Y (t) = {y~ Ay) e"^* = (-4a; + 2y) e"''* = -AX + 2Y. (5.7) 
We can see that 2X — Y = (generally X and Y will be proportional) and hence 

2X -Y ^ const. = 2X (0) - y (0) = 2 - 1 = 1. 
Hence, substituting this back into (5.6) and (5.7) 

X = -1 giving X = -f + ci, Y = -2 giving F = -2t + C2. 
As X (0) = Y (0) — 1 then Ci = C2 = 1 and we have our solution: 

X {t) ^ {1 - t) e^\ y {t) ^ {1 - 2t) e^K 

■ 

Example 74 Consider the linear system 

dx dy /X.N 

^=a; + y, dl""^^^' a; (0) = y (0) = 1. 



Solution. The matrix 



-1 1 

has complex eigenvalues 1 ± z. The corresponding eigenvectors are solutions of 

:;_\)(;)-, (_■,;)(:)-■ 

and so we can take 



VI = ^ ) , V2 

The formula (5.5) holds still and we have 
As X (0) = y (0) = 1 then 

A + B = l and i{A-B) = l 
so that A={l-i)/2 and B={l+i) /2. Hence 

X (t) = (iZLilgd+i)* + il±i)e(i-i)t = I {(e^* + e"^*) - i (e^* - e"^*)} = e* (cost + sint) , 



and similarly we find 

y(t) = li±lle(i+^)* + tllllei^-^)* = I {(e'* + e"^*) + i (e'* - e"**)} = e* (cost - sint) . 
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5.3 Epilogue — approximations* 



To end this chapter here are three examples where, perhaps in the face of not being able to solve the differ- 
ential equations exactly, it is still possible to solve similar differential equations which approximate the more 
complicated original problem. This material is not expressly on the syllabus; at the same time it would give a 
false impression of the study of ordinary differential equations to discuss only examples which could be solved 
exactly. 

Example 75 A mass P swinging on the end of a light rod is governed by the differential equation 

g . . 

dt^=-r^"^- 

Suppose that the maximum swing of the mass is a. Show that the time T of an oscillation equals 



T = 4 




2^ Jo \/cos9 — cos a 



Solution. See Exercise 59. ■ 

Normally of course we make the assumption that the pendulum moves through small oscillations and ap- 
proximate sin ^ » to get the standard SHM equation, whose solutions have a period of 



Ta = 27r4 



Graphically we can see how the two compare 




1 1.2 1.4 



The approximation remains within a 1% error, that is T/T4 < 1.01, for 6 < 0.398 radians « 22.8°, and within 
a 5% error, that is T/Ta < 1.05, for 6 < 0.874 radians w 50°. 



Definition 76 (Euler's Method) This method suggests that an approximate solution to the initial-value prob- 
lem 

^ = / (a;, y) , y (a) = yo 

can be found using the iteration 

Vn+i =yn + hf{a + nh, yn) 

where y^ = y{a-\- kh) and h is the length of increment in x. The method works on the assumption that y will 
continue to grow on the range nh < x < {n-\-l)h at roughly the same rate as it was growing atx = a-\-nh, y = yn- 
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The smaller the h, the better the chance that this is a reasonable assumption. 



4 










3.5 










3 










2.5 








..^^ ^^^^ V R 


2 








. y4 


1.5 






Y2 




1« 

0.5 


YO 


yi 










Xj 








0.2 


0.4 0.6 


0.8 1 


Solving y' = 


X + y with yo = 


1 andh = 0.2 



Example 77 i^mrf, using Euler's method, an approximation to y {X) where y satisfies the initial-value problem 

dy 



dx 



y + x, y(0) = l. 



Use an increment of X/N and show that y^ — *■ y{X) as N ^ oo. What value of N do we need to get y (1) to 
within 5%? to within 1%? 

Solution. Using integrating factors we can solve the equation exactly to get 

y (x) = 2e^ - 1 - a;. 

The iteration for Euler's method is 

Vn+i =yn + h{nh + yn) = (1 + ft.) j/„ + h'^n, y (0) = 1, 

where h = X/N. Now, either by induction or with knowledge of difference equations, we can solve this recursion 
to get 

yn = 2{l + hf-hn-l. 

Setting h = X/N and n = iV we find 



J/jv = 2 1 + 



X 



N 



n; 

It is a well-known limit, from analysis, that (1 + X/N)^ as N —* oo and so y^ does indeed tend to y {X) 

as AT — > oo. 

To answer the second part, we note with X = 1 that 



1 



N 



?/jv = 2 ( 1 + - ) - 2, whilst y (1) = 2e - 2. 



With a calculator or computer we can find that 

Vn 

y(i) 

VN 

2/(1) 



> 0.95 for N ^ 15; 

> 0.99 for N ^ 79. 
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Example 78 (Critical Points and Stability) The Volterra-Lotka equations are 

dx dy , , 

— = -mx + axy, — =by- kxy, 

where a,b,k,m are positive constants, and they model competing numbers of predators x and prey y. The 
critical points {x,y) are points where the populations x and y remain constant, i.e. where x = y = 0. Solving 
these equations we see that the two critical points are 

(0,0) and {b/k,m/a). 

What would happen though if the steady populations of x = b/k,y = m/a were disturbed a little, e.g. some few 
extra predators or prey introduced? 

Solution. To see this we make a change of variable X = x — b/k, Y = y — m/a. The Volterra-Lotka equations 
then become 



dX 

~dt 



( b\ ( b\ f m\ abY 

-dF = 'V+^)-'['' + -k)V+^)--—'''''^- 

But X and Y represent only the small changes in populations, and so the second order term XY will be smaller 
still and negligible for small enough pertubations. So we can approximate the equations to 

dX {ab\ ^ dF _ / -fcm \ ^ 



dt \ k J ' dt 

The equations 



dX fab\ ^ dF _ f ~km ^ ^ 



dt \ k J dt \ d 

form a linear system of equations like the ones we studied in the previous section. We could solve them using 
similar methods, but that is really unnecessary here as we can note 



d^X fab\ dY fab\ f^km 



dt^ \k J dt \k J \ a 

d^Y / —km\ dX f —km\ ( ab 



X = -bmX, 
Y = -bmY, 



df^ \ a J dt \ a / \ k 

which are SHM equations and X and Y are given by 

X = ai cos (^Vbmt + si^ , Y ~ a2 cos (^Vbmt + , (5.8) 

for constants ai,a2,£i,£2- So the critical point {b/k, m/a) is stable, meaning that if we perturb the populations 
from their steady states they remain roughly the same. In fact, from equations (5.8) we can see that the 
populations move around the critical point in a circular/elliptical motion. Unsurprisingly this type of critical 
point is called a circular one. ■ 



5.4 Exercises 



Exercise 87 Find the eigenvalues and two independent eigenvectors vi,V2 of the matrix 

Put the vectors vi, V2 as the columns of a 2x2 matrix P. Find P^^ and verify by direct calculation that P^^AP 
is a diagonal matrix. 
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Exercise 88 * Consider the following matrices, A, B, C, D. Where it makes sense to do so, calculate their 
sums and products. 



Exercise 89 * Let 




Calculate the products AB, BA, CA, BC. 
Exercise 90 Let 



--{tl). --{-l-l}- (-») 

Show that AB = 0, but that BA 7^ 0. 

Exercise 91 * Let A and B be invertible n x n matrices. Show that {AB)~^ = B~^A~^. Give an example to 
show that {A + B)^ ^ A^^ + B^^ in general. 

Exercise 92 * Determine whether the following matrices are invertible and find those inverses that do exist. 

2 3 

3 4 

Exercise 93 * Calculate A~^ when A is 





1 






1 




2 







i 


2 


i 


1 


3 






1 





1 


-3 





1 


-2 


4 


2 


-5 


5 



Exercise 94 Find the general solution of the system 

dx , dy 

-^=x + 4y, ^-x + y. 

Find the solution {x {t) , y {t)) to the above system for which x{0) =0 and y (0) = 2. Sketch this curve in the 
xy -plane, and sketch in your eigenvectors vi,V2. 

Exercise 95 Find the solution of the system 

^ = -3x + V2y, ^ = V2x-2y, 
dt dt 

when x{0) = £1, y (0) = £2- Is the system's critical point (0, 0) stable? 



Exercise 96 Find the eigenvalues of, and two independent eigenvectors for, the matrix 



2 3 
1 4 



^i) Find the general solution of the system 



Solve 



dx ^ „ dy 

— =2x + 3y, -£=x + 4y. 

dt dt 



^=2x + 3y + 2e^\ § = a; + 4y + 36^* 
dt dt 

2 7 fr\\ — 1 



subject to the initial conditions x{0) = and y (0) = |. 
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Exercise 97 Find the general solution of the simultaneous equations 

- + 2x-2y = t, ^-3. + , = e*. 

Exercise 98 A 2 x 2 matrix A is diagonalizable and has repeated eigenvalue X. Show that A = XI2. 

Give an example of a 3 x 3 matrix A which is diagonalizable and which has a repeated eigenvalue but which 

isn't a multiple of L^. 

Exercise 99 * Let 




Use formula (5.1) to find the eigenvalues of A, i. e. the roots of det {A — xl) . Find an eigenvector corresponding 
to each eigenvalue. 

Exercise 100 * Find the general solution of the simultaneous differential equations 

dx ^ dw d2; „ „ , 

— =f)X-y-z, —=x + 3y + z, — = -2x + 2y + Az. 

Exercise 101 * Solve the following simultaneous differential equations for x, y, z as functions of t given that 
X = —2, y = 2,z = 3 when t = and 

x' {t)=x-2y + 2z, y' (t) = a; + y, z' (t) = 2y-z. 

Exercise 102 * Show that 6 is a repeated eigenvalue of 




and find two independent 6-eigenvectors. Hence find an invertible matrix P such that P ^AP is diagonal. 
Exercise 103 * Find the general solution of the simultaneous differential equations 

da; „ „ dw „ „ dz „ „ 

— =%x + y + 2z, -^ = 7y + 2z, — = -2y + 2z. 

Exercise 104 * Find a 2 x 2 matrix P with complex entries such that 

p-iCP=( \ 0. 

\ u —I 

where C is as given in Example 62. 

Exercise 105 * Find the solution y {x) of the initial-value problem 

^ = X + 2y, y (0) = 0. 
A numerical approximation to y{x) is calculated using Euler's method: 

~ rih^ - = Xn ~t~ 2y^. 

h, 

Show that there is a solution of the form 

yn=A{l + 2hf +Bn'^ + Cn + D 

to the above recurrence relation which satisfies yo = 0, and confirm that, as h —> 0, the numerical approximation 
converges to the solution y {x) of the initial-value problem. 
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Exercise 106 * Show that Xn = Aa'^ + B^^ is the general solution of the recurrence relation 

Xn+2 ~{a + /3) Xn+i + oij3x„ =0 (n ^ 0) 
where a ^ j3. Find A and B in terms of yo and yi. 
Exercise 107 * Solve the following recurrence relations 

(i) Xn+2 - ix„+i + 2xn = 0; 

(ii) Xn+2 + 5Xn+l + QXn = Xq = Xi = 1; 

(Hi) Xn+2 + '^Xn+l + 2x„ = Xq = Xi = 1. 

Present your answer to part (Hi) so that it is clear the solution is a real sequence. 
Exercise 108 * Solve the following recurrence relations 

(i) a;„+2 - 3a;„+i + 2a;„ = n; 

(ii) Xn+2 + 5Xn+l + 6Xn = Xq = Xi = I. 

Exercise 109 * (a) List the six bijections from the set {1,2,3} to itself. 

(b) Every such bijection can be produced by repeatedly swapping just two elements at a time. Find two 
different ways of producing the bijection 1 ^ 2, 2 ^ 3, 3 ^ 1 as a composition of swaps. 

(c) It is a theorem from the Hilary term algebra course that, for a given bijection, the number of swaps will 
always be even or always be odd; so these bijections are said to be even or odd. List the even bijections from 
{1,2,3} to itself 

(d) Show that formula (5.1) agrees with the below formula 

^Sgn (/) 01/(1)02/(2)03/(3) 

/ 

where the sum is taken over all bijections f from {1, 2, 3} to itself and sgn (/) = 1 or —1 depending on whether 
f is even or odd. 

(e) The general nxn determinant formula is given by a similar expression summed over all bijections from 
{1, 2, . . . , n}. How many such bijections are there? 



Exercise 110 * Let 



A={ I I) and B=(l J 



Find all those 2x2 matrices X which commute with A — i.e. those satisfying XA = AX. 

Find all those 2x2 matrices Y which commute with B — i.e. those satisfying YB = BY. 

Hence show that the only 2x2 matrices, which commute with all other 2x2 matrices, are scalar multiples 
of the identity matrix. [This result holds generally for nxn matrices.] 



Exercise 111 * Let A denote the 2x2 matrix 



a b 
c d 



Show that 

- (traceA)A + (det A)l = (5.11) 

where traceA = a + d is the trace of A, that is the sum of the diagonal elements, det A = ad — be is the 

determinant of A, and I is the 2x2 identity matrix. 

Suppose now that A" = for some n ^ 2. Prove that det A — 0. Deduce using equation (5.11) that = 0. 



Exercise 112 * Let 

A = 

Show that 



5 -1 
4 1 



2n + 3 — n 

4n 3 - 2n 



for n = 1, 2, 3, Can you find a matrix B such that B"^ = A? 
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6. PARTIAL DIFFERENTIATION 



We began our study of ordinary differential equations (ODEs) by modelling the flight of a projectile. Let's 
consider here, before discussing particd derivatives and particd differential equations (PDEs), a slightly 
more complex example of a mug of tea cooling down on a table top. 

In what sense is this new physical scenario more complicated to model? The quantity we are naturally interested 
in here is the temperature T of the liquid. In our earlier motivating example of the projectile, its height depended 
only on time t. Let's take the mug's dimensions as radius R and height H. Here the temperature of the tea will 
again depend on time t, but almost certainly not in a uniform way throughout the mug. So rather than just 
depending on time t the temperature will also depend on the spatial co-ordinates x, y and z. Also heat will be 
lost to the air, down the sides of the mug, to the air beyond and may heat up the table and surrounding air. 

The differential equation governing the behaviour of T{x,y,z,t) is a partial differential equation. The heat 
equation, (also more generally referred to as the diffusion equation), states that 

dT _ /d^T d'^T d'^T\ 
dt \ dx^ dy'^ dz^ J 

where k is known as thermal diffusivity. These "fancy" derivatives with the curly ds just reflect that T depends 
on several variables. 

• The partial derivative dT/dt is the derivative of T with respect to time t when we keep all the other 
variables x, y, z constant. 

• Again dT/dt is a function of the four variables x, y, z, t. 

• It is a measure of what we'd see if we focus separately on each point {x,y,z) and watch how the temperature 
changes over time. Because the tea is cooling we would expect that dT/dt < throughout the mug and 
at all times. 

What other information would we need to describe the tea's cooling? Well, we would again need an initial 
condition saying how hot the tea was to begin. So we might assume it to begin uniformly hot and have 

Tix,y,z,0)=To 

for some initial temperature Tq and for x^ + y^ < , < z < H. 

We would also need to make some assumptions about how the heat dissipated out of the mug. If we assume 
that air remains constantly at some ambient room temperature Ta then one boundary condition would be 

T{x,y,H,t)=TA 

for x^ + y'^ < R"^, t > 0; this describes the temperature's behaviour at the top of the mug (though it isn't a 
particularly realistic assumption physically!). If say the mug were insulated to allow no heat loss then another 
boundary condition would be 

dT , 

— x,?y,0,t =0 
dz 

for + < i?^, t > at the base of the mug. Likewise down the sides of the mug we would have 

— {x,y,z,t)=^ 

foic x"^ + y^ = R^ , < z < H, t > and where r denotes the distance of a point from the central axis of the 
mug. 

Indeed, having noticed this symmetry in the cooling we might decide to change variables and consider this as 
a problem in just three variables r, t, z reducing the problem somewhat. 
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6.1 Partial Derivatives 



Definition 79 Let f : W 



be a function of n variables Xi,X2, ■ ■ ■ ,Xn. Then the partial derivative 
df 



dx, 



{pi,...,Pn) 



is the rate of change of /, at (pi, . . . ,Pn) , when we vary only the variable Xi about pi and keep all of the other 
variables constant. Precisely then 

df , . f {pi,...,Pi-i,Pi + h,Pi+i,...Pn)- fipi,...,Pn) 
— ipi, . . . ,pn) = lim . 

OXi h-yO h 

By contrast, derivatives such as df/dx are sometimes referred to as full derivatives. 

• If / (x) is a function of a single variable note that 

da; dx 

• df/dx is is still pronounced "d f by d x" or sometimes "partial d f by d x". 

• We shall also, occasionally, write fx for df/dx, which is equally common notation. 

• In some texts df /dxi is denoted as /j. 



Notation 80 



1 

3 
Z 
1 

,6. -5 



I ■ 
I 

\ & t 

I — in tliis- aiEectiQQ 
I a Y ■ ■ ■ 



tPi,P£) I d f 



— — in^'Chtis direction 
' d X 
[ ■ 
\. ■ 
t 
L 



^(a,5) = lim^^i^±MW(M) 

dx h^Q 



|^(a,6) = lim:^M±M-^ 

dy k^a 



h dy fe^o k 

• To stress which variables are being kept constant some texts use the notation 

dl 
dx 

to denote that this is the partial derivative of / with respect to x whilst keeping y constant. This is a 
measure of how quickly / (x, y) is changing as we move from the point (pi,P2) along the line y = P2. We 
will however always may clear which co-ordinate system we are using and so a partial derivative will 
always denote differentiation with respect to some co-ordinate and all other co-ordinates in the system 
being kept constant. 



Example 81 Let 



Then 



f (x, y, z)=x'^ + ye"" 



dl 
dx 



= 2x + ye'', 



^ = - - 
dy y2 



dl^l 
dz y' 
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Example 82 Let 

Then 



f {xi,X2,X3,X4) = sin(a;ia;2) + cos 0:4. 



- — = X2Cos(a;iX2) , ^ — = cos (a;ia;2) , ^ — = 0, 
0x1 0x2 0x3 

Note that df/dx^ = as the definition of f is independent 0/2:3. 



dl 
dxi 



= — sina;4. 



• Note more generally that df/dx^ = despite / being non-constant; / needs only to be constant as we 
vary x^ and so the general solution of the equation df/dx^ = is 

f {xi,X2,X3,X4) = g{xi,X2,X4) 

for an arbitrary differentiable function g involving the other variables. 

Example 83 Let f iM.'^ be defined by 

f(x v) = ( ifx^O ory^O, 

■' ^ ]^ 1 otherwise 

Note that the partial derivatives df/dx and df/dy exist at (0,0) with 



1^(0,0) = H^/(/^.0)-/(0.0)^H^^ 
dx h^o h h^o h 

fiO,0) = lin. = lin. ^ 

dy h^o h h^o h 



yet f is not continuous at (0, 0) . This contrasts with full derivatives where differentiability implies continuity 
(see Hilary term analysis course). 

Definition 84 We may define second and higher partial derivatives in a similar manner to how we define them 
for full derivatives. So, in the case of second partial derivatives of a function f{x,y): 



also written f^x = > 
also written fyx = {fy)^ , 
also written f^y = {fx)y , 
also written fyy = {fy)y . 





d 


(df 


dx"^ 


dx 


[dx 


dV 


d 


(dl 


dxdy 


dx 


\dy 


d'f 




fdf 


dydx 


dy ' 


Vax, 






(df 




dy ' 


{dy. 



Example 85 Let us return to the function 



f{x,y,z)=x'^+ye=' + 



from Example 81. Then 



So 



dl 
dy 



z 



di^i 

dz y' 



9V 
dx-^ 

dV 

dxdy 

d\f 

dxdz 



= 2 + ye^ 



= 0, 



d'f 




d\f 


dydx 




dzdx 


d^ 


2z 


d'f 


dy^ 


y6 


dzdy 


d\f 


-1 


92/ 


dydz 


yJ 


dz^ 



= 0. 
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Note in the previous example 



be such that and exist and are continuous. Then 



dydx dxdy^ dzdx dxdz^ dzdy dydz' 
This will typically be the case in the examples we'll see, as the following theorem shows: 

Theorem 86 Let f ■.'M? - 

dydx dxdy ' 

Proof. (For reference only — requires Hilary term analysis, and non-examinable) . For x, y, h,k €M. define 
</> {x, y) = f {x,y + k) - f {x, y) , and ip {x, y) = f {x + h,y) - f {x, y) 

so that 

D{x,y) = f {x + h,y + k) - f {x + h,y) - f {x + h,y) + f {x,y) 
= 4){x + h,y)-4){x,y) 
= tPix,y + k)-ipix,y). 

By the Mean- Value Theorem, applied twice, there exist f?i,02 G (0, 1) such that 

D {x, y) = 4){x + h,y) - 4) (x, y) = h(j)-,. {x + Oih, y) 
= h%{x + eih,y + k)- U{x + 0ih,y)] 
= hkfooyix + 9ih,y + 92k). 

Arguing similarly with from the D {x, y) = tp {x,y + k) — tp {x, y) expression we know there exist ^3, 04 € (0, 1) 
such that 

D {x, y) = hkfyx {x + O^h, y + Oik) . 

So 

fxy {x + Oih, y + 6*2^:) = fyx {x + 6*3/1, y + O^k) 
Letting h and k tend to 0, and using the continuity of fxy and fyx we see that fxy — fyx as required. ■ 

Example 87 Define f : 



. by 

fix,y) 



Show that 



'ityf^ if ix^y)^ {0,0), 
if {x,y) = {0,0) . 



d'f ^ d^f 
and 



dydx dxdy 



exist yet are unequal at (0, 0) . 
Solution. We have for a; 7^ and y ^ 0, 



dl 
dx 



(o,y) 



[x^ + y- !j [.ix- - y- - xy 



and similarly 



Hence 



{x^ + y^) 

(x^ + y^) X {x? - 3y^) - xy {x? - y^) 2y 



x=0 



; X. 
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Example 88 Find all solutions of the form f {x, y) to the partial differential equations 

= 0, M -^=0. 



dydx ' dx'^ 
Solution. The first PDE is 

dy ^dx) ^' 

Those functions g {x, y) which satisfy dg/dy = are functions p {x) which solely depend on x. So we have 

df . . 

This looks like an equation we would normally just integrate up, not forgetting a constant. But again d/dx 
sends to zero any function Q [y) of y. So instead of a constant we have an arbitrary function of y. The solution 
then is 

f{x,y)=P{x) + Q{y) 

where P [x) is an anti-derivative of p [x) , i.e. P' (x) — p {x) . 

For the second equation S^f/dx^ = we can integrate in similar fashion to get df jdx = p {y) and then 

/ (x, y)=p{y)x + q (y) 

where p (y) and q (y) are functions just of y. m 

Remark 89 Note how the solutions to these second order PDEs include two arbitrary functions rather than 
two arbitrary constants as is often the case of an ODE. This makes sense though when we note that partially 
differentiating wrt x annihilates functions solely in the variable y and not just constant functions. 



6.2 Co-ordinate Systems 



In the examples we have seen so far we have considered functions f {x,y) or g{x,y,z) where we have been 
thinking of x, y, z as Cartesian co-ordinates. / and g are then functions defined on a 2-dimensional plane or in 
3-dimensional space. There are other natural ways to place co-ordinates on a plane or space. Indeed, depending 
on the nature of a problem and any inherent symmetry, it may be very natural to use other co-ordinates. 

In the next section we will derive the chain rule which describes the relationship between derivatives in one 
system and another. For now, we simply introduce some important examples of co-ordinate systems. 

Example 90 (Planar Polar Co-ordinates) 

Instead of considering a point of a plane as so much along each of two perpendicular axes, we can equally 
describe a point P as being at a certain distance r from an origin O and such that OP makes an anti- clockwise 
angle 6 with a fixed axis, usually the positive x-axis. This is shown in the diagram below. 

j 




1 £ S 4 5 
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We can see that the equations relating Cartesian co-ordinates and planar polar co-ordinates are 



X = r cos y, y — rsmi) 

r = a/x^ + 2/^, ta,n9 = y/x. 

So in the above diagram where x = 3 and y = A then 

r = -\/32 + 42 = 5 and 8 = tan~^ ^ « 1.249 radians. 

Note that r takes values in the range r € [0, 00) and 6 e [0,27r) ,/or example, or equally (— tt, tt]. Note also that 
6 is undefined at the origin. 

Remark 91 Note the definition of partial derivative df /dx very mucli depends on the co-ordinate system that 
X is part of. It is important to know which other co-ordinates are being kept fixed. For example, we could have 
two difTerent co-ordinate systems, one the standard Cartesian co-ordinates and the other being x and the polar 
co-ordinate 9. Consider now what dr/dx means in each system. In Cartesian co-ordinates, we have 

r = \/x^~+y^ and so dr/dx = a;/-\/a?M-~y^. 
However when we write r in terms of x and 6 we have 



x/ cos9 and so — 



dr 1 y^a;^ + 



dx cos 9 X 

which is certainly a different answer! The reason is that the two derivatives we have calculated are 

and so are measuring the change in x along curves y ~ const, or along 9 = const, which are very different 
directions. Indeed note that the two are equal only when cos^ 9 = 1 in which case lines of constant 9 and 
constant y are in the same direction. 

Example 92 (Changing from, polar to Cartesian co-ordinates and vice versa) 

Given a curve with equation f {x, y) =0 then it can be rephrased as an equation in polar co-ordinates as 

g (r, 9) = f {r cos 9, r sin 9) = 0. 

The unit circle x? + = 1 clearly becomes r = 1 and the line x = k becomes r = k sec 9. 

More generally, the line ax + by + c = becomes r {a cos 9 + bsinff) = —c which can be rewritten as 

r = A sec {9 — a) 

where A = — c/V + 6^ and tana = b/a. 

In reverse, given a curve in polar co-ordinates F (r, 9) = 0, then this can be rewritten as 



G{x,y) = F (v'a;2 + y2,taii-i (|)) = 0. 



For example, r — cos9 becomes r^ = rcos9 which gives x^ + y'^ — x. This we see is the circle 

which is a circle of radius 1/2 around the point (1/2,0) . Note the circle is mapped out as 9 varies from —it/2 
to 7r/2. ^45 6 varies from 7r/2 to 37r/2 then cos9 is negative — when r is negative then {r,9) is plotted in the 
opposite quadrant to (— r, 9) reflected through the origin — and so in this case as 9 varies from ■jt/2 to 3n/2 the 
point (r, 9) retraces the circle once more. 
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Example 93 (Planar Parabolic Co-ordinates) 

Planar Parabolic co-ordinates u, v are given in terms of x and y by the relations 



a; = - (w -V ) 



y — uv. 



Note that the curve u ^ c, in Cartesian co-ordinates, is 2xc^ — c^ — y'^ and the curve v ~ k, in Cartesian 
co-ordinates is 2xk^ — y'^ — both of which are parabolas. As u and v vary over the positive numbers then 
{x,y) varies over the upper half-plane. 




Example 94 (Cylindrical Polar Co-ordinates) 

We can naturally extend planar polar co-ordinates into three dimensions using a further z co-ordinate. In 
this case they are called cylindrical polar co-ordinates. The relationships between r, 4>, z and x, y, z are given by 



x — r cos ( 



and 



r = ^/x^ + j/^ 



y = r sm i 

tan (j) ~ 



Note that r — const, defines a cylinder, (f> — const, defines a vertical plane through the origin and z — const, 
defines a horizontal plane. 




(n 0, z) 



Cylindrical Polars 



z 








X 





Spherical Polars 



Example 95 (Spherical Polar Co-ordinates) 

In the same way that latitude and longitude are used to determine a position on the planet, we can similarly 
use two angles 9 and (f> to determine position on concentric .spheres distance r from the origin. These are called 
spherical polar co-ordinates. 

Here r is simply the distance of a point P from the origin O. The angle 9 is the angle OP makes with the 
vertical z-axis and takes values in the range — 7r/2 ^ ^ 7r/2. Finally, if Q is the projection of P vertically into 
the xy-plane then (j) is the angle OQ makes with the positive x-axis. The relationships between x, y, z and r, (j), 9 
are given by 

X — r sin 9 cos <j), y — t sin 9 sin 4>, z = r cos 9, 

and 

X 



r — i/a;-^ + + 



tan ( 



1^2 _1_ y2 
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6.3 Exercises 



Exercise 113 Let f {x,y) = exp(y/x) where x ^ 0. Find the partial derivatives fx, fy, fxx, fxy, fyx, fy 
verifying that f^y = fyx- 



+ z^) for (x, y, z) (0, 0, 0) , then verify that f^x + fyy + fzz = 0. 



Exercise 114 If f {x,y, z) 
Exercise 115 Let F (x,y) = f [x/y) where f is a differentiable function of one variable. Show that 



^dF _ Q 

dx ^ dy 

Exercise 116 Let F {x, y) = f (y Inx) where f is a differentiable function of one variable. Show that 



dF dF d'^F 2, d'^F 

X— + y— =xy^-^ X Inx- 

ox ay oxoy 



dx"^ 



Exercise 117 Let F [x, t) = f {x — ct) + g {x + ct) where f and g are differentiable functions of one variable 
and c is a constant. Show that c^F^x = Fu- 

Exercise 118 If x'^/a'^ + y'^/b'^ = 1 find dy/dx and d'^y/dx'^ by implicit differentiation. 
Exercise 119 Find the point on the curve x"^ + xy + y'^ = I which is closest to the line x + y = 2. 
Exercise 120 By changing to polar co-ordinates, or otherwise, sketch the curve (x^ +y^)^ = x'^- 
Exercise 121 Sketch the logarithmic spiral r = ae^^ where a, b are positive constants. 

Show that the angles between the tangent to the curve and the radius from (0, 0) to (r, 6) , is always constant. 
Show further that the curve's arc-length from the origin (as 6 —> —oo) to the point (r, 0) is finite and determine 
this arc length in terms of a and b. 

Exercise 122 Polar co-ordinates r and 6 are defined for a;>0, yeM by r = ^x^ -\- y^, and 9 = tan~^ iv/x) ■ 
(i) Sketch the curves r = const, and 9 = const. At what points is dr/dy positive? At what points is dy/dr 

positive? 

(iA)Find%, f, If, f, Ip, §, %. Verify that %%<\ at all points. 

Exercise 123 The variables u and v are defined in terms of x and y > by 



u = \/x'^ + y'^ + X, 



v 



+ y2 



X. 



(i) Sketch the following curves u = i and v = i for i — 1,2, 3. [N.B. remember that y > 0.] From your diagram 
only, explain why dx / du > and dy/du > at every point. 

(ii) Write X and y as functions of u and v. Find ^, ^, ||, ||, |f, |f. 

Exercise 124 Show that the function f {x,y) = {Ax"' -\- Bx~"') {C cos ny + D sin ny) , where A,B,C,D,n are 
real constants and n is non-zero, satisfies the partial differential equation 

92/ 1 df 1 „ 

— - H -\ = 0. 

dx"^ x dx dy"^ 

Exercise 125 What functions f {x, y, z) satisfy the following partial differential equations? 



(n) 



= 0. 



dxdydz 

Exercise 126 What functions f {x, y) satisfy the following partial differential equations? 



df d'^f df 



= 0. 
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Exercise 127 What functions f {x,y) satisfy the following partial differential equations'! 

What value does your solution to the second equation take on the lines y = ±1? 
Exercise 128 Let x gM. and t > 0. Verify that 

1 {x, t) = —= exp 



^/t "^yAnt 

is a solution of the heat equation 



dt dx^ ' 

Sketch T as a function of x at two different times t. Show that 



dT d^T 

^l^- (6-1) 



/oo 
T{x,t) 
-oo 



dx 



is a constant function oft. 
Exercise 129 Show that 



e-' ds 



T{x,t)= r 

Jo 

satisfies the heat equation (6.1). 
Exercise 130 Verify, for y > 0, that 



Exercise 131 * Let 



i{y) = / 

where y > 0. Given that 



sin a; , 
e da; 

X 



d / ^_^y sinx ^\ _ d ( sina; 
dy Vo a; ) Jo dy 



calculate I' {y). What is the limit limy_>oo I {y) ? Deduce that 



I{y) = \- tan ^ y. 



Exercise 132 * Given that 

d_ 
da 

show that 



( r -^^ a\ r 9 ( \ A 

I J e cos ax dx J = J q~ \^ cos ax j dx, 

r -X- A (-°'^\ 

I e cos ax dx = exp — — . 

Jo 2 4 y 



Exercise 133 * The Legendre polynomials Pn{x) (see Exercises 85 and 86) can he defined by means of a 
generating series; to be precise we can define them by the equation 



-J oo 

, =y^Pn{x)r, \t\<l. 



n=0 

By differentiating partially with respect to t, or otherwise, show that 

(n + 1) P„ (a;) = (2n + 1) a;P„ (a;) - nP„_i (a;) . 
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7. CHANGE OF VARIABLE. CHAIN RULE 



7.1 Functions of Two Variables 



Functions of more than one variable are common throughout mathematics. The motivating example for partial 
derivatives at the start of the last chapter involved a temperature T which depended on three spatial co-ordinates 
I, y, z and one temporal co-ordinate t. Such functions are often associated with maps as well. For example, we 
might have a physical map denoting the height z above a point {x, y) . The map might also include contours 
which are the curves z = c of constant height. Here 2 is a scalar function of two variables. 




- S[A LEVEl- 
CONTOUR INTEKVAL, IOC M 




Or the map might be a meteorological map denoting the wind speed and direction (at a fixed height) above a 
point (x, y) . Above is a wind-direction field associated with a hurricane — with each point (x, y) is associated 
a vector u (a;, y). That is, u is a vector-valued function of two variables. 

Example 96 The height z of a col above the point {x, y) is given by the function 

f{x,y) = 10-{x-lf+y^. 

Write this as a function g{r,6) of planar polar co-ordinates. 

Solution. As X = rcosO and y = rsinff then we can write 

f {x,y) = 10 — (r cos6' — 1)^ + (rsin^)'^ 

= 10-r^cos2 6l + 2rcos6'- l-t-r^sin^e* 

= 9-r^cos26'-h2rcos6' 

= 9{r,e). 



Remark 97 Note that / and g are different functions, even though z — f {x,y) — g (r, 
that z = f {r,9) , rather z is given by a different rule in terms of r and 6. The rule 

z^f{r,9)^ lO-{r-lf+0^ 



It is not the case 



is clearly not the right one! 
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7.2 The Chain Rule 



The chain rule for two or more variables serves the same purpose as the chain rule for one variable. Recall that 
the one-variable chain rule states that 

d/ d/ du 
dx du dx ' 

The rule arises when we wish to calculate the derivative of the composition of two functions f {u{x)) with 
respect to x. 

Likewise we might have a function / {u, v) of two variables u and v, each of which are functions of variables x 
and y. We can then make the composition F 

F (a;, y) = f{u {x, y) , v {x, y)) , 

which is itself a function of x and y. We might then wish to calculate its partial derivatives 

dF , OF 
— and — . 
ox ay 

The chain rule states that 

dF df du df dv 

dx du dx dv dx ' 

dF df du df dv 

dy du dy dv dy 

Before going on to prove the chain rule, here is an example approached two different ways. 
Example 98 Let 

f {u,v) = {u — v)smu + e'" , u{x,y) = x^ +y, v{x)=y — 2x, 
and let F {x, y) = f {u {x, y) , v {x, y)) . Calculate dF/dx and BF/dy by (i) direct calculation, (ii) the chain rule. 
Solution, (i) We have that 

F {x, y) = {x^ + 2a;) sin {x^ +y) + exp (y — 2x) . 



Hence 



= {2x + 2) sin [x^ +y)+2x [x^ + 2x) cos [x^ +y)-2 exp [y - 2x) ; 



dx 

— = (x^ + 2x) cos (x^ +y) + exp {y - 2x) . 



(ii) Using the chain rule we have 



BF _ df du ^ df dv 

dx du dx dv dx 



(sinw + (u — f ) cosu) 2x + (— sinu + e") (—2) 

{2x + 2) sin (x^ +y)+2x ix^ + 2x) cos ix^ +y) -2 exp {y - 2x) ; 



OF _ df du dfdv 
dy du dy dv dy 



(sin u+ (u — 7;) cos u) (1) + (— sin u + e'") (1) 
(x^ + 2x) cos (a;^ +y) + exp (y — 2a;) . 
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Theorem 99 (Chain Rule) Let F {t) — f {u{t) ,v{t)) with u and v differentiable and f being continuously 
differentiable in each variable. Then 

dF _ dldu_^dldv 
dt du dt dv dt 

Proof. (Not examinable) If we change t to t + 5t, let 5u and 5v be the corresponding changes in u and v. Then 



5u = + St, and Sv = + £2 ] St, 



where £1, £2 — > as 5f ^ 0. Now 



SF = f {u + Su,v + Sv) — f (u, v) 

= [f {u + Su, V + Sv) — f {u,v + Sv)] + [f {u, V + Sv) — f {u, v)] 



By the Mean-value Theorem (Hilary term Analysis) we have 

df 

f {u + Su,v + Sv) — f {u,v + Sv) = Su— {u + 9-i_Su,v + Sv) . 

df 

f {u,v + Sv) - f {u,v) = Sv— {u,v + 92Sv) , 
for some 61,62 € (0, 1). By the continuity of /„ and fy then we have 

Su^ (u + 6iSu,v + Sv) — Su ( ^ (u,v) + r]i 
ou \au 

Sv^{u,v + e2Sv) = Sv(^^{u,v)+r]2 

where r/^, r/2 ^ as Su, Sv 0. So, putting this all together 

SF Su (df ^ ^ \ Sv (df , , 

"re^ i%{u,v) + r)\^(%+e^ i%\u,v) ^2] • 



At V V \di ) \dv 

Letting St ^ we get the required result. ■ 

Corollary 100 Let F {x, y) = f {u {x, y) , v {x, y)) with u, v differentiable in each variable and f being continu- 
ously differentiable in each. Then 

OF _ df du ^ df dv dF _ df du ^ df dv 
dx du dx dv dx ' dy du dy dv dy 

Proof. This follows easily from the previous theorem by treating first x and then y as constants when differ- 
entiating. ■ 

Example 101 A particle P {x, y, z) moves in three dimensional space on a helix so that at time t 

x{t)=cost, y{t)=smt, z{t)=t. 

The temperature T at {x,y,z) equals xy + yz + zx. Use (i) direct calculation, (ii) the chain rule, to calculate 
dT/dt. 

Solution, (i) 

T{t) = x (t) y{t)+y (t) z{t) + z (t) x (t) = costsint + tsint + tcost. 

So 

dT .00. 

— - = — sm t + cos t + smt + cos t + t cos t — tsmt. 
dt 
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(ii) Alternatively the chain rule says that 



dT _ dTdx dTdy dT dz 

dt dx dt dy dt dz dt 

= (y + z) (— sin t) + {x + z) cos t + {x + y) 

= {t + sin t) {— sin t) + {t + cos t) cos t + (sin t + cos t) 

= — sin^ t + cos^ t + sint + cos t + t cos t — t&m t. 



Example 102 Let z = f {xy) where f is an arbitrary differentiable function in one variable. Show that 

dz dz ^ , , 



Solution. By the chain rule 



Hence 



dz dz 
^=y/'(a:y) and —= xf {xy) . 

dz dz 

- = xyf {xy) - yxf {xy) = 0. 

■ 

Example 103 Conversely show that any solution of (7.1) is of the form z = f {xy). 
Solution. We first make a change of co-ordinates 

u = y/x, V = xy. 

We are aiming to show that 2; is a function solely in t;, or equivalently that dz/du = 0. By the chain rule 

dz dz dx dz dy 
du dx du dy du ' 

Solving for x and y in terms of u and v we have that 



Then 



— and y = y/wv. 

u 



dz —1 nr dz 1 fv dz 

du 2 \J u^ dx 2\ udy 

— Ix^dz 1 dz 
2 y dx 2 dy 

— Ixfdz dz\ ^ 
2 y\ dx ^ dy) 

and z — f {v) — f {xy) as required, where / is an arbitrary differentiable function. ■ 

Example 104 A particle P moves around on the unit sphere r = 1. Find P's velocity v {t) = dr/dt in terms 
of the spherical polar co-ordinates </>, 9, and their derivatives and verify by direct calculation that v • r = 0. 

Solution. Recall that 

X = sinO coscf), y = smOsm^, z = cos6, 
where 9 and ^ are functions of t. Now 

X = cos 6 cos ^9 — sin 6 sin </> ^, 
y = cos 6sm(j)9 + sin 6 cos </> ^, 
z = —sm96. 
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So 

V • r = ^cos cos 4>9 — sin 9 sin (p sin cos (f> 
+ ^cos sin ^ ^ + sin 9 cos ^ </» j sin 9 sin (/> 
+ sin 61 e".) cos^ 

= 6 (cos 6 cos^ sin ^ + cos ^ sin^ </> sin ^ — cos ^ sin 6) 
(— sin^ ^ sin (f> cos + sin^ 9 sin ^ cos (j)) 

= 

■ 

This is true for any movement on the sphere. We can prove this much more easily by differentiating the 
vector identity r • r = 1 to get 2v • r = 0. 

To find the particle's acceleration by means of a chain rule we would need the next theorem. 

Theorem 105 (The Second Order Chain Rule) 
Let F {x, y)= f{u {x, y) , v {x, y)) . Then 

L^xx ~ fu'^xx ~l~ fv'^xx ~t~ fuu (^x) ~t~ ^fuv'^x'^x ~t~ fvv (^x) ) 

^xy ~ fu'^xy ~i~ fv^xy ~t~ fuu'^x'^y ~t~ fuvi^y'^x ~t~ '^x'^y) fvv'^x^y) 

^yy fu^yy ~t~ fv^yy ~t~ fuu ("^y) ~t~ ^fuv'^y'^y ~t~ fvv ("^y) • 

Proof. 

-^xx ~ (,fu'^x ~t" fv^x)x 

~ ^fu)x ^fv)x fu'^xx ~t~ fv'^xx 

~ i^fuu'^x ~i~ /uv^x) ^x ~t~ (yfvu'^x ~t~ fvv'^x^ ^x ~i~ fu^xx ~t~ fv'^xx 

~ fu^xx ~t~ fv'^xx ~l~ /uu (^x) ~l~ ^fuv'^x'^x ~t~ /vu ("^x) 

and the other results follow similarly. ■ 

Example 106 -Let u = — y"^ and v = x"^ — y for all x,y gR, and let f {x, y) — x^ +y'^. 

(a) Find the partial derivatives Ux,Uy,Xu,yu in terms of x and y. For which values of x and y are your 
results valid? 

(b) Find /„ and fy. Hence show that fu + fv — ^, and that fuu = fw Evaluate fuu in terms of x and y. 

Solution, (a) Clearly = 2x and Uy = —2y. We can also differentiate the expression for u and v with respect 
to u to get 

l = 2xxu-2yyu, = 2a;a;„ - y„. 
Solving the simultaneous equations in Xu and ?/„ gives 

1 1 



2x(l-2y)' l-2y- 

These expressions are clearly valid when x ^ Q,y ^ 1/2. 
(b) By the chain rule 

f-f Mf 2a; ^ 2y _ l + 2y 

Iu-JxXu + hyu-^^^^_^y^ + ^_2y - l-2y- 

If we calculate Xy and y„ in a similar fashion to the above calculations for Xy, and ?/„ we obtain 

y 1 

'^''-a;(2y-l)' ^^-2j;-l- 
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So, by the chain rule again, 



fv — fx^v ~1~ fyVv — /(-) 1 \ 



x{2y-l) 2y-l 2y - V 
Hence 

_ (l + 2y) + (-4y) _ 
/« + - ^ _ 2y 

Now differentiating the identity fu + fv = ^ separately with respect to u and v gives 

fuu ~t~ = = -\- fyy. 

As = fyu then = /^^ as required. To find fuu explicitly we can use the chain rule again to note 
fuu = Xu + {fu)y Vu = + ^ (t^) 1^ " (l-2yf 



7.3 Exercises 

Exercise 134 Suppose f {x, y) = x"^ + where 

X [t] = tcost and y [t) = tsint. 
Let F (t) — f (x (t) ,y (t)) . Find dF/dt in two ways: (i) directly, (ii) using the chain rule. 
Exercise 135 Use the chain rule to find dF/du and dF/dv when F [u, v) = f {x, y) and 

f{x,y) = y/x, x = u-\-v, y = u-v. 
Exercise 136 Let z{x,y) = (y/x) , where g is a differentiable function of one variable. Show that 

xzx + yzy + z = 0. 

Show conversely that the only solutions of this partial differential equation are of the form z {x, y) = g {x, y) jx. 
Exercise 137 Let f {x,y) = xg (y/x) where g is a differentiable function in one variable. Show that 

X fxX ~t~ ^Xyfxy -\- y fyy — 0. 

Exercise 138 Let u = x + y and v = xy, and let f be a suitably differentiable function in x and y. Express fx 
and fy in terms of /„ and /„. Show further that 

fxy fuu ^" ufuv ^" fvv ^" fv • 

Exercise 139 Suppose that f is a function of x and y, and that y is a function of x. Show that 

^ f \\ 9f , 9f dy 

If y{x) is defined implicitly by the equation f {x, y) = 0, deduce that 

dy 75/ 
da; dx / dy' 



Show further that 

^2 
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Exercise 140 The equation 3y = + 3xz defines z implicitly in terms of x and y. Evaluate z^x, Zxy and Zyy 
in terms of x and y. Verify that z is a solution of 

Exercise 141 Find dz/ds and dz/dt given that 

z = x^ + xy, 

a;2 + y3 = st + 5, 
3 2 2 , .2 

X —y = s +t . 

Exercise 142 Given xs"^ +yt^ = 1 and x'^s + y'^t = xy — 4 find dx/ds, dx/dt, dy/ds, dy/dt at 

{x,y, s,i) = (1,-3,2,-1). 

Exercise 143 Let w = x^ + xy + z^ and 

x'^+x_y^ + y_z^+z 
~ 3 ~ 4 ~ 5 ■ 

Find Aw /At. 

Exercise 144 Let a; = e" cos v and y = e" sin v and let f {x,y) = g {u, v) . Show that 

(^X ^" y ) {fxx ^" fyy) guu ^ 9vv 

Exercise 145 Variables u and v are given in terms of x and y by 

u = x'^—y'^, v = 2xy, 
and let g {u, v) = f {x, y) . Using the chain rule show that 

^ = 2/+4x^-4 + 8xy— ^+ 

ox^ ou au^ auav av^ 

and find a similar expression for f jdy^ . Hence express d"^ f / dx^ + d"^ f / dy'^ in terms of the partial derivatives 
of g. Deduce that if f {x,y) = x + y then 

d^Q cP'q 
— - ^ — - = 0. 

Exercise 146 The function f (x, y) satisfies the equation 

^ dx ^ dy 

Let F (u, v) — f {x, y) where 

u = x^ —y^ and v = 2xy. 

Use the chain rule to find df /dx and df jdy in terms of dF/du and dF/dv. Hence show that dF/dv = and 
that f is, in fact, a function of x"^ — y^ only. 

Exercise 147 Let z {x, y) = f (xy) where f is a twice- differentiable function of one variable. Show that z 

satisfies the equation 

^xx ~t~ y Zyy ~t~ "^Zx ~\~ yZy — 2xy Zxy • 

Exercise 148 // / : M" ^ M has continuous partial derivatives and is homogeneous of degree k, that is 

/(tx) = fV(x) 

for any x G R", prove Euler's equation that 
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Exercise 149 Given that x +y + z =6 and + = 5xy + 12 find 

/dz\ /dz\ /dz\ /dz\ / dw\ f dx^ 

\di)^' \d^)j \di)j \d-y)j yo^)^' \d^, 

at the point [x, y, z, w) = (1, —2, 1, 1) 

Exercise 150 Given f {x, y,z) =0 find expressions for 

dx)^' \dy)^' \9zJ^' \dx 

Show that 

(?^] =1 and =-1. 
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8. PARTIAL DIFFERENTIAL EQUATIONS 



8.1 Introduction 



We have already solved, in earlier chapters, a few simple examples of partial differential equations (PDEs), 
namely: 

• Example 88 (i): fyx — has solution / (a;, y) — A{x) + B {y), 

• Example 88 (ii): fxx = has solution / {x, y) — C {y)x + D (y), 

• Example 103: xf^ = yfy has solution / (x, y) = E [xy) , 

where A, B, C, D, E are arbitrary difFerentiablc functions. 

And we've already noted that it's not surprising to see a fcth order partial differential equations having 
general solutions involving k arbitrary functions in the same way fcth order ordinary differential equations 
often involve k indeterminate constants; this is because differential operators like d/dx and d/dy eliminate all 
functions which all functions not involving their variable. As with ordinary differential equations though it is 
not always the case that k functions will be involved. 

In this first course in multivariate calculus we will not look to discuss more than a few important examples 
of PDEs and hardly touch on the problem of solving a PDE with boundary conditions (something for the Hilary 
term course). Here though we do give one method of finding some solutions to a particular PDE with boundary 
conditions. 

Example 107 Find all solutions of the form T {x, t) = A {x) B (t) to the heat/diffusion equation 

dT _ d'^T 

dt dx'^ 

where n is a constant, in the range 0<x<l,0^t and which satisfy the boundary conditions 

T{0,t)^T{l,t)^Ofora.llt;?0. (8.1) 

Solution. Solutions of the form A {x) B (t) are known as separable solutions and most solutions will not be 
of this form. Nevertheless they do have an important role in solving the equation generally (see next term's 
PDE course). If we substitute 

T{x,t)=A{x)B{t) 

into the heat equation we find 
and separating the variables we get 



A {x) B' (t) = kA" (x) B (t) 
A" {x) B' {t) 



A{x) ~ KB{t) ^^-^^ 

Let's call the quantity in equation (8.2) c{x,t). Note c{x,t) is both a function solely of x (from the LHS) and 
a function solely of t (from the RHS). It follows that for any {xi,ti) , {x2,t2) 

c(xi,ii) = c{xi,t2) [as c only depends on x] 
= c{x2,t2) [as c only depends on t]. 

That is, c is constant! We find the solutions in A (x) from (8.2) depend now on whether c is positive, negative 
or zero. We have the following three possibilities: 

Pi exp (v^x) + Qi exp (— v^a;) c> 0, 
A{x)= { P2X + Q2 c = 0, 

Ps cos (v^^ca;) + Q3 sin (v^^ca;) c < 0. 
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As we shall see only one of these cases can fit the boundary conditions A{0) — A (1) — 0, which follow from 
(8.1), in a non-zero fashion. 

• Case (i): c > 0. We have A (x) = Pi exp {\/cx) + Qi exp {—^/cx). The boundary conditions imply 

Hence Pi = Qi = and A(x) = is the only solution of this form to satisfy A{0) = A (1) = 0. 

• Case (ii): c = 0. We have A (x) = P2X + Q2- The boundary conditions imply that P2 = Q2 = and, 

again, A (x) = is the only solution of this form to satisfy A{0) ^ A (1) = 0. 

• Case (iii): c < 0. We have A {x) = P3 cos (-y^or) + Qs sin (^^/^x) . The boundary conditions imply that 

Ps = 0, Q3 sin (v^ = 0. 

If A (x) is to be a non-zero solution we need Q3 to be non-zero and hence sin (a/— c) = 0. This means 
c = — n^TT^ for some non-zero integer n. For this value of c we have 

A (x) = Q3 sin (mrx) , 

B (t) = exp (— Atn^TT^f) [up to a scalar multiple.] 
So we see there are non-zero separable solutions of the form 

T {x, t) = Kn sin {mrx) exp (— Kn^Tr^t) 
where n is a non-zero integer and Kn is real. 



Remark 108 Note that any sum of separable solutions will also solve the original heat equation, but generally 
this sum will not, itself, be a separable solution. So the separable solutions are far from being the only ones. 
However the separable solutions do have an important role in the general solution of the above boundary-value 
problem as the general solution can be written as an infinite sum of the separable solutions. See the Hilary 
Term courses on Partial Differential Equations and Fourier Series. 



8.2 Laplace's Equation 



In Cartesian co-ordinates Laplace's equation reads as 

^r-^ + = (in the plane); 

d^f d^f „ , . ^ 

~dx^ 'g^ ~ * dimensions). 

A function which satisfies Laplace's equation is known as harmonic. Laplace's equation is often abbreviated 
to 

vV = o 

and the differential operator is known as the Laplacian. 

Laplace's equation appears in many physical situations. For example, the following are harmonic: 

• the gravitational potential in a region containing no matter; 

• the electrostatic potential in a charge-free region; 
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• the steady-state temperature in a region containing no source of heat; 

• the velocity potential of an incompressible fluid with no vortices, sources or sinks. 

Harmonic functions also have an important place in pure mathematics, for example in the study of complex 
functions. 

Example 109 Let z = x + iy and let f (z) = z"". If we write u = Ref and v = lmf so that 

u{x,y) +iv{x,y) = {x + iy)"^ , (8.3) 
show that u {x, y) and v {x, y) are both harmonic functions. 

Solution. If we differentiate (8.3) twice with respect to x, and also twice with respect to y, we get 
Uxx + ivxx = n{n-l){x + iy)"'~'^ ; 

Uyy + ivyy = 71 (ti — l) (x + iy)^~'^ = " Ti {u — l) {x + iy)^~'^ . 
Hence, adding the previous two equations, 

i^^xx ~i~ '^yy) ^" ^ i^^^xx ^" '^yy^ — 
Comparing real and imaginary parts we see that u and v are both harmonic. ■ 

The above example extends more generally to differentiable complex functions — see Exercises 159 and 160. 

As before we can look to find important classes of solutions of Laplace equation by insisting that the solutions 
have a certain form. For example we can consider separable solutions (see Exercise 156), circularly symmetric 
solutions (Example 110 below) and spherically symmetric solutions (see Exercise 163). 

Example 110 Show that Laplace's equation in planar polar co-ordinates reads as 

d^f Idf 1 d^f 
dr^ r or r^ 86 

Hence find all circularly symmetric solutions to Laplace's equation in the plane — i.e. find all harmonic 
functions f (r) which depend only on r and are independent of 6. 

Solution. Firstly we need to know how Laplace's equation reads in polar co-ordinates. Recall that r = 
^Jx"^ -\-y'^ and 6 = tan~^ {v/x) so that 



Tx 



X [x' + y^y'l^ , rxx = {x' + y')-'^' - x' {x' + y^'^' = yVr^ 



0x = {-y/x'')/{l + yyx'')=-y/{x''+y^), 6xx = 2xy {x^ + y^)' 



Similarly 



: y{x'+ y')-'" , rxx = {x' + y'y'^' - y' {x' + yY'^' = x'/r^ 
Oy = {l/x)l{l + y^/x^)=x/{x^+y^), Oyy = -2xy {x-" + y^^ . 

So for any / defined on the plane 

fxx + fyy = fr {Vxx + ryy) + fe {Q xx + Qyy) + frr ijl + r^) + 2fr6 {r xB x + ryQy) + f^Q {fx + ^l) 

= fr (y' + x^) /r' + fe (0) + frr {x^ + y') /r' + 2fr9 (0) + fee {x^ + y') /r^ 
= fr/r + frr + fee/v^ 

Hence Laplace's equation, in terms of planar polar co-ordinates, reads as (8.4). We are interested in finding the 
circularly symmetric solutions, solutions which are independent of 6. In this case (8.4) now reads as 

— - H ^ = 0. 

dr^ r dr 
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This equation has integrating factor r and so we have 



giving 



and so the general solution is 



dr V dr / dr^ dr 



d/^ A 
dr r 

f {r) = Alnr + B 



One useful aspect of parabolic co-ordinates (see Example 93) is that Laplace's equation reads the same in 

both Cartesian and parabolic co-ordinates. 

Example 111 Recall the definition of parabolic planar co-ordinates 

y = uv. 



— 



2 ' 

Show that Laplace's equation 

transforms into the same equation in parabolic co-ordinates. 
Solution. Prom the second order chain rule we have 

= Fx+ U^FxX + lUVFxy + V^Fyy 

= -Fx-\- v^Fxx - 2uvFxy + u^Fyy . 

Hence 

and Laplace's equation reads the same in each co-ordinate system. ■ 

Example 112 Let A e M and n e Z. Show that the function g {x, y) = A sinh {wkx) sin {n-Ky) satisfies Laplace 's 
equation and the boundary conditions 

g {t, 0)=g {t, 1) = fort^O and g{0,t)=0 for < i < 1. 

By using parabolic co-ordinates, or otherwise, find a non-zero solution h{x,y) of Laplace's equation which 
satisfies the boundary conditions 

h{t,0)=0 for t^O, h{t,l/t)=0 fort^l, and h{t,t,) = for < t < 1. 

Solution. Firstly note 

gxx — n^ir'^A sinh (nnx) sin (nny) , and gyy = —n^n^A sinh (rnrx) sin (mry) 

and so g {x, y) is indeed harmonic. Further 

g {t, 0) ^ A sinh {mrt) sin (0) = 0, 

g {t, 1) = A sinh (riTrt) sin {mi) = 0. 

g (0, t) = A sinh (0) sin {mrt) = 0, 

as required. 
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If we combine the previous part (with A = n = 1) and the previous example we know that the function 

Mx,,) = si.l.-'-V''%^(„.) 
satisfies Laplace's equation and also is zero when 

=0 or xy = 1 or xy — 0, 

which, we see, include the given boundary conditions. ■ 

Poisson's equation is the inhomogeneous Laplace equation 

vV = g. 

It appears (amongst other places) in gravitational theory: Laplace's equation V^(/> = dictates how the 

gravitational potential (p behaves in the absence of any matter, but, more generally, when there is matter 
present,. distributed with density function p{x,y,z) , then Poisson's equation 

= -AttGp 

describes the potential's behaviour. As the Laplacian is linear then the only added complication in solving 
Poisson's equation — as compared with solving Laplace's equation — is in finding a particular solution 

Example 113 Find the circularly symmetric solutions of Poisson's equation in the plane V^/ — g where 



9{r) = 

Solution. Recall that the Laplacian is given by 



r for r < a 
for r > a 



in planar polar co-ordinates. We are only interested in circularly symmetric functions of the form / (r); so for 
r < a we need to solve 

/"(r) + r-V'(r-)=r. 

If we multiply both sides by r we get 

(r/'(r))' = r/" (r) + /' (r) = r^. 

Hence 

r/'(r) = rV3 + A^/'(r) = rV3 + Ar-i 
/(r) = r^/9 + yllnr + B. 

If we solve the equation similarly in the region r > a then we find 

/(r) = Clnr + £l. 

There are several things to note though. Firstly if this function is to be defined at r = it must be the case 
that j4 = 0. But also there are issues of continuity and smoothness at the r = a border. For our solution / to 
be continuous at r = a we need 

aV9 + -B = Clna + D 

and for it to be smooth at r = a (that is d//dr agrees on both sides of r = a) we need = Cja. Hence 

A = 0, C = aV3, £>= (aV9)(l-31na)+B. 
So the most general circularly symmetric solution of this Poisson equation is 



^ + B for r < a, 

^lnr+^(l-31na) + B for r ^ a. 



To specify the solution uniquely then we might, for example, require a further condition such as / (0) = which 
would give B = 0. ■ 
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8.3 The Wave Equation 



We will consider the vibrations of an clastic uniform string under certain simplifying assumptions:- 

• the string undergoes small vibrations: by this we mean that we shall consider second order terms, like 
and yl negUgible; 

• the vibrations are entirely transverse, so that a point at distance Xq along the string remains on the line 
X = a through out the motion; 

• the string is at tension T and the density of the string is uniformly p; 

• the effects of gravity and air resistance are negligible compared with the tension in the string. 

Consider the vibrations of a small section of the string from a; to a; + Ax. We shall denote the tension of the 
string at y {x, t) as Ti and the tension at y{x + Aa;, t) as T2, and likewise denote the angles the string makes 
with horizontal at those two point by ^1 and 62 as shown in the diagram below. 




If we resolve the forces in the a;-direction we get that 

T2 008 02 =Ticos6ii 

as the wave is vibrating transversely (i.e. this segment of string is only moving up and down), and taking 
components in the y-direction we get 

T2 sin 62 - Ti sin ^1 = (pAx) {xo , t) 

where xq Ih the .x-co-ordinate of the segment of mass, and because pAx is the segment's mass and yu its vertical 
acceleration. Note that xq will lie between x and x + Aa; as the entirety of the segment of string lies in this 
range. (Remember the weight of the string we consider negligible compared with the tension involved). 

If we divide the previous equation by the constant Ti cos^?i = T2 cos 62 which, to the level of second order 
approximation we are making, equals the tension T, we get (with a little rearranging) 

tan 02 — tan 01 p d^y . . 
A^ = TSi^(^°'')- 



Now 



dy 

tan 6*1 = — {x,t) , 



tan 09 



dx 



(x + Aa;, t) 



and so by the Mean- Value Theorem (see next term's Analysis) it follows that 
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for some cG {x,x + Ax) . Now letting Ax tend to 0, and assuming continuity of yxx and ytt we get 

Note that p/T has units {kg / m) / {kgms~'^) = {s/mf so that c = \/T/ p has the units of velocity. The wave- 
equation then reads 

Theorem 114 (D'Alembert 1746) The general solution of the wave equation 

d^y _ ^ d'^y 
dt"^ dx^ 

is 

y {x, t) = f {x - ct) + g{x + ct) . 
Proposition 115 Proof. We introduce two new variables 

( = X — ct and r] = x + ct. 

Then by Theorem 105 we have 



Vxx = ycCxx + Vv^xx + ya iCx? + '^ycvCxVx + ym (Vxf 

X + X - 

ya + 2yc») + yw 



= yt^xO + y^xO + y^^x {if + 2y^r, x 1 x 1 + x (1)^ 



and 



ytt = ycCtt + yriVtt + ycciCtf + ^ycr,<tVt + yririi%f 



ycxO + yr,xO + y^^x {-cf + 2y^^ x (-c) x c + x (c)' 

{ya - 2yc^ + ym) 

<? {ya + 2^^? + yvv) = ^ (ya - '^ycv + ym) 



Hence c^y^x = ytt if 



so that 

ycv = 0- 

We know, from Example 88 (i), that the general solution of this is 

y = f{0+g{ri) = f{x-ct)+g{x + ct). 



• Consider a solution of the form y (.-r, t) = f {x — ct), with y = 0. Note that this solution at time t + T 
resembles the solution at time t, but translated to the right by cT. This is a wave moving to the right 
with speed c and likewise a wave of the form g{x + ct) is one which is moving to the left at speed c. 

Example 116 Find the solution to the wave equation when 



y^""'^' 1 otherwise ' dt ^ ' ' 



Sketch your solutions at ct = 0, ^,1,^. 
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Solution. We know that the solution has the form y [x, t) — f {x — ct) + g {x + ct) and so the initial conditions 
give that 



f{x)+9{x) 



-cf'{x) + cg'ix) = 0. 



1 - \x\ -1 < x < 1, 
otherwise, 



Integrating the second equation we see that f (x) = g {x) + K where K is a, constant. We can, without any 
loss of generality take K to be 0. [Keeping K in the calculation would yield to the same solution though with 
slightly different / and g.] So f = g and we have 



y{x,t)^ f{x-ct) + f{x + ct) 



where 



/(«) 







otherwise. 



Sketches of the solution at different t are given below. 



-4 -3 -S -1 




■0.5 

Ct = 0.5 



i.s 

1.Z5 



-■4 -S -& -1 



-JO.S' 
Ct 
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8.4 Exercises 



Exercise 151 Show that the function f {x,y) = tan ^ {y/x) satisfies Laplace's equation in the plane. Find 
functions g and h such that f can be written in the form 

f {x, y) ^ g{x + iy)+h{x- iy) . 

Exercise 152 By making the substitutions rj ^ x — y and ( = x — 2y show that the general solution of 

"1" ^fxy ~t~ fyy ~ 

is f {x,y) = A {x — y) + B {x — 2y) where A and B are arbitrary differentiable functions of one variable. 

Exercise 153 Let k be an integer. Show that r*' cos r*' sin A:^ satisfy Laplace's equation in planar polar co- 
ordinates (See Equation (8.4)). Why is the requirement made that k is an integer? 
Hence find a solution to Laplace 's equation which satisfies the boundary condition 

fil,e) = 008^6' + sin 61. 

Exercise 154 Write the solutions r^ cos k9 and r'^ sin k9 in the previous exercise in the form g{x + iy) + 

h {x - iy) . 

Exercise 155 Either directly, or by using polar co-ordinates, show that if f {x, y) is harmonic, then so is 
f {x/r'^, y/r^) where r"^ = x^ + y^ . 

Exercise 156 Show that separable solutions f {x,y) — X {x)Y {y) of Laplace's equation in Cartesian co- 
ordinates take one of the forms 

X{x) = Ax + B, Y{y)=Cy + D, 

X {x) = Aexp (\/ca;) + Bexp (— vex) , F (y) = Coos (vcy) + D sin (-/cy) where c> 0, 
X (x) = Acos (-\/ca;) + .Bsin (— -^/ca;) , F (y) = Cexp (-\/cy) + £>exp (— where c> 0. 

Exercise 157 Show that separable solutions f{r,9) = R{r)@{6) to Laplace's equation in planar polar co- 
ordinates must satisfy the equations 

r'^R" + rR' -cR = 0, (8.5) 

e" + ce = 0, 

for some constant c. 

By making the substitution r = e* and setting T (t) = R{r) = i?(e*) into equation (8.5) show that 

T {t) = Ae*^' + Be"*^* if c = k"^ > 0, 

T{t) = At + B ifc = 0, 

T{t) = Asm {kt) + B cos (kt) i/c=-fc^<0. 

Hence find all solutions of Laplace's equation which are separable in polar co-ordinates. 

Exercise 158 Let x £R and t > 0. Verify that 

T(.,t) = ^exp(^) 

is a solution of the heat equation ^ = Sketch T as a function of x at two different times t. Show that 

/oo 
T{x,t) dx 
-00 

is a constant function oft. 
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Exercise 159 * Show that the real and imaginary parts of the following functions are harmonic: 

11 X u 

_ ^x+iy _ e^cosx + ie^sin?/; - = = ^ - i—^ — o 

z x + iy a;^ + y"' a;^ + 

Exercise 160 * Let / : C ^ C be a complex-valued function with real and imaginary parts u and v so that 

f{x + iy)^u {x, y) + iv {x, y) . 
f is said to he differentiable at z G C if 

f'iz) = lnnli^±^^l^ 



exists as /i — > inC 

(i) Let /i = a; e M. Show that 



^^ fi^ + h)-f{z) _du ^ .dv_ 
h-yo h dx dx' 



(a) Let h = iy where y gW. Show that 

f(z + h)-f(z) dv .du 

lim — — ^——^ = I — . 

h^o h dy oy 

(Hi) Deduce the Cauchy-Riemann equations which say that Ux = Vy and Vx = —Uy. Hence show that u 
and V are harmonic functions. 

Exercise 161 The co-ordinates u and v are defined by 

X = cosh u cos V, y = sinh u sin v 
where x and y are Cartesian co-ordinates. Show that 

dx^ %2 sm"^ V -\- sinh^ u \du^ dv"^ J ' 

Exercise 162 (i) Given spherical polar co-ordinates r,6,(f) show that 

Br = (sin 6 cos (p, sin 9 sin (j), cos 6) . 

e^ = (— sin (j), cos </>) 

Be = (cos 9 cos cos 9 sin cp, — sin 9) 

are mutually perpendicular unit vectors. 

(a) Show that if r (t) is a vector with co-ordinates r{t),4> (t) , 6 (t) at time t then 

r' (t) = fBr-\-(j) (r sin^e^) + 9 (ree) . 

Exercise 163 Show that the Laplacian V^/ equals 

d^f 25/ cot^a/ 1 ay 

rd^ ~i^d9 ^ r'^sm9d^' 

Hence find the spherically symmetric solutions of Laplace 's equation in three dimensions. 

Exercise 164 * Let xi,X2 be co-ordinates describing two-dimensional space (e.g. Cartesian or polar). 

(i) Explain why there exist functions /ii,/i2 of x\,X2 and unit vectors b\,B2 such that for any path r(i) in 
the plane we have 

r' (t) = /iiiiei + h2X2e2- 

(a) If the vectors 61,62 are orthogonal then xi,X2 are said to be orthogonal curvilinear co-ordinates. 
Show that Cartesian, polar and parabolic co-ordinates are all orthogonal curvilinear and find /ii,/i2jei,e2 in 
each case 
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Exercise 165 * Show that, in terms of orthogonal curvilinear co-ordinates X\,X2 in the plane 



dxi \hidx\) dx2 \h2dx2 



Verify that this formula agrees with (8.4) in the case of planar polar co-ordinates. 

Exercise 166 * Let L > 0. Find all the separable solutions 

y{x,t)=X{x)T{t) (8.6) 

of the wave equation yu = (?yxx which satisfies the boundary conditions 

y (0, t)=y (L, = for all t. 

Exercise 167 * A string with small vertical displacement y {x, t) , where a ^ x ^ b, is fixed at each end so that 
y{a,t) = = y (6, t) for all time t. Show that the string's energy 



is constant throughout the motion. ( The first term of the energy formula is the tensile energy and the second 
term represents the string's kinetic energy.) 

Exercise 168 * Recall that the general solution of the wave equation yu — c^yxx has the form y {x, t) = 
f {x + ct) + g{x — ct). If further we impose the initial conditions 

y {x, 0) ^ h {x) , yt {x, 0) = v (x) 

show that 



h{x + ct) + h{x-ct) 1 



y{x,t) ' ^"^ ^ — — + ^J ^v{s)ds. 



Exercise 169 * A stretched string is fixed at two end-poi/ids x = and x = I so that y {0,t) = = y {I, t) for 
all times t. Investigate the natural modes of vibration of the string by seeking solutions to the wave equation of 
the form 

y {x, t) = f (x) sin ujt. 

Show that, with the given boundary conditions, such vibrations can occur only at certain natural frequencies 

nc 

- = -, n=l,2,3,... 

Show that the higher the value of n, the larger the number of nodes ( a node is a point of zero displacement on 
the string throughout its motion). 
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9. JACOBIANS AND AREA 



9.1 A Brief Introduction to Double Integrals 



In a moment we will meet the definition of a Jacobian. The definition is simple enough, but, in order to motivate 
the idea behind it, we will need to calculate some areas. As we change from one co-ordinate system to another 
the equations of curves change, the co-ordinates of points, the rules for functions. Likewise the Jacobian is a rule 
for how measuring areas, or volumes, changes with a co-ordinate change. Our first two examples of calculating 

areas involves two examples of double integrals. 

Example 117 Calculate the area of the triangle with vertices (0,0) ,{B,0) and {X,H). 
Solution. 







CX,H) 


























i 




p 


b 



We have known, since early school days that the answer is \BH, but we shall demonstrate this here by 
means of a double integral. The three bounding lines of the triangles are 



H 



H 



y 



X-B 



{x-B). 



We'll assume here that Q < X < B . In order to "pick up" all of the triangle's area we need to let x range from 
to -B and y range appropriately from up to the bounding line above (x, 0) . Note that the equation for this 
bounding line changes eX x ^ X which is why we need the split the integral in two in the calculation below. As 
our X and y vary over the triangle we need to pick up an infinitesimal piece of area da; dy at each point. We 
can then calculate the triangle's area as 



l-x=X i-y=Hx/X ,-x=B py=H(x-B)/(X-B) 

/ / dyda:+ / / 

lx=0 Jy=0 J x=X ■Jy=0 

r''^dx+ r^' Hjx-B) 

l^—n X .l^—y X — B 



dy da; 



dx 



H 
X 



X 



+ 



H 



X-B 



{x-Bf 



B 



J X 



H (X-B)' 



HX^ 

X^ X-B 2 
HX H{X-B) _ HB 
~^ 2 ~ 



Integrating this way, y first then x, we would need to treat X < and X > B as two further cases. 
Alternatively, one could also pick up the area by first letting y range from to H and letting x range over the 
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interior points of the triangle at height y 



. i 




x= (X-B) Y/H+B 



This method is somewhat better as we don't have to treat the three cases ofX<0, 0<X<B and B < X 
separately. 



^y=H i.x=y(X-B)/H+B 



Jy=0 Jx=Xy/H 

-"^ ({X-B)y 



- f 

Jy=0 

- r 

Jy=o 



H 



+ B- 



da;dy 



B 



B H 



y 

2H 

_ IP 
2H 



y=H 



BH 



H 



dy 



Example 118 Calculate the area of the disc + < a^. 

Solution. Again we know the answer, namely na"^. If we wish to pick up all of the disc's area we can let x vary 
over the range —a to a and, at each x, we need to let y vary from — — x'^ to \/a^ — x^. So we have 



f-x—a py=\/a^ — x^ 
J x=—a J y=—y/a'^—x'^ 

= / 2\/ — x^ dx 

J x——a 



dydx 



t/2 



/TT/ 

/Try 



7r/2 
7r/2 



2\/a^ — sin^ 9 a cos 9 d6 [a; = asin^] 



2cos^6id6i 



/2 
7r/2 

/2 



1 + cos 29 d(9 



I + - sin 261 



it/2 
-n/2 



-KC? . 



In the first example of the triangle using y as the outside variable and x the inside helped avoid making special 
cases. For this example of the disc though it would have been much more natural to use polar co-ordinates — 
if we knew how to calculate areas with such! 
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9.2 Jacobians 



You may be aware that the modulus of a determinant of a 2 x 2 matrix A is a measure of how much the 
map X I — > Ax stretches area. In this case the map does so uniformly at each point. The Jacobian, or rather 
its modulus, is a measure of how a map stretches space locally, near a particular point, when this stretching 
effect varies from point to point. The Jacobian takes its name from the German mathematician Carl Jacobi 
(1804-1851). 

Definition 119 Given two co-ordinates u{x,y) and v {x,y) which depend on variables x and y, we define the 
Jacobian 

d{u,v) 



to be the determinant 



det 



d{x,y) 



du du 

dx dy 

dv dv 

dx dy 



Example 120 Let u and v be related by the equation 



a b 
c d 



Then 



d{u,v) 
d {x, y) 



= det 



a b 
c d 



ax + by 
cx + dy 



= ad — be. 



(9.1) 



Consider the parallelogram that is the image under the above map (9.1) of the square with vertices (0,0) , (1,0) , 
(1, 1) , (0, 1) . This should have area \ad — bc\. 



( a+hi , c +d) 




The area of a parallelogram with sides a and b is |a A b| Hence the above parallelogram's area is 

|(ai + cj) A {bi + (ij)| = |adi A j + 6cj A i| = [(ad - be) k| = \ad - bc\ . 
More generally any region with area A will transform under the map (9.1) to a region with area \ad— bc\ x A. 
Example 121 Let x = rcosO and y = rsm6 where r and 6 are polar co-ordinates. Then 

d{x,y) 



d{r,e) 



det 



det 



dx Ox 
Sr 80 



dr 09 

COS 9 —rsinO 
sin 6 r cos 9 

■2 a , „:„2, 



= r (cos^ 9 + sin^ 9) = r. 
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Example 122 In reverse, r = + and 6 = tan ^ {y/x) and 

/ dr dr \ 



d (r, 6>) 
d{x,y) 



y dx dy 



= det ( V^'+y^ \/^^+y^ 



+ y 



,2 



(a;2+y2)3/2 

1 1 



y 



The Jacobian satisfies its own sort of diain rule as one might expect, given that the effect on area of two 
scalirigs would be the product of the scalirigs. 

Proposition 123 Let r and s be functions of variables u and v which in turn are functions of x and y. Then 

d (r, s) d (r, s) d (u, v) 



d{x,y) d{u,v) d{x,y) 



Proof. 

d (r, s) 



d {x, y) 



( dr dr \ 
t i 

dx dy J 

( dr du I 9r &u 9r du < dr dv 

du dx ' dv dx du dy ' dv dy 

ds du I 9s dv d£ du i 9s dv 

du dx ' dv dx du dy ' dv dy 



Corollary 124 



{/ dr_ ar \ / 9h du 

\ du dv / y dx dy 

d (r, s) d {u, v) 
d{u,v) d{x,y)' 



d{x,y)d{u,v) ^ ^ 
d {u, v) d {x, y) 



Solution. Take r = x and s = y in the previous proposition. Indeed the stronger result holds true that 

dx 



dy 
du 




Example 125 Recall the definition of parabolic planar co-ordinates 

1 



a; = - (u^ — v^) , y = uv. 



In this case the Jacobian is given by 

d{x,y) 
d {u, v) 



= det 



= det 



dx dx 
du dv 
oy dy 
du dv 

U —V 

V U 



2 , 2 
U + V . 
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Though it is somewhat messy to calculate u and v in terms of x and y we can easily calculate that 

= 4a;^ + 

d{u,v) _ 1 _ 1 

d {x, y) +v'^ 4 (a;^ + y^) ' 

Example 126 Calculate Ux,Uy,Vx,Vy in terms of u andv where 

X = a cosh u cos v, y = a sinh u sin t;. 



Solution. We have 



Hence 



1 


Xy 




\ Vu 


Vv , 





asinhucosu — acoshusint; 
a cosh u sin u a sinh u cos v 



( Ux 


Uy \ 


- ( 


V Vx 


Vy ) 





a sinh u cos v —a cosh u sin u 
a cosh w sin u a sinh u cos v 

1 



sinh u cos v cosh u sin v 



a(sinh^ucos2u + cosh^wsin2u) V - cosh u sin i; sinh u cos u 
■ 

Though 3x3 determinants are off this syllabus, (see Hilary term Linear Algebra) included here are the 
Jacobians of spherical and cylindrical polar co-ordinates. 

Example 127 (Cylindrical Polar Co-ordinates) 

X = r cos 6, 



y = r sin^, 



Then 



d{x,y,z) 
d{r,e,z) 



dx 
dr 
ay 



dx 



dx 



oy 

dz_ oz_ 

dr 80 dr 

COS 9 — rsin^ 

sin 6 r cos 6 
1 

cosO —rsinO 
sin 9 r cos 9 



Example 128 (Spherical Polar Co-ordinates) 

X = rsm9 cos (j), 



Then 



d(x,y,z) 
d{r,cl>,9) 



dx 


dx 


dx 


dr 


d4> 


de 


dy_ 


dy 


dy 


dr 


d<t> 


de 


dz 


dz 


dz 


dr 


dip 


de 



y = rsm9sm(f), z = rcos9. 



sin ^ cos —rsm9sm4> r cos 9 cos (j) 

sin 9 sin (j) r sin 9 cos (p r cos 9 sin (j) 
cos 9 sin 9 



-r sin ^ sin ^ r cos 9 cos cj) 
r sin 9 cos ^ r cos 9 sin ^ 



cos 9 

2 

-r^ sin 9 (cos^ 9 + sin^ 9) 
— sint 



r cosfsmfcos 



sin^ 6 — cos^ ( 



— r sin ^ 



— r sin 



sm y cos 
sin 9 sin 

2 X 



—r sin 9 sin <p 
r sin 9 cos <^ 



9{ 



cos 



+ sin^ 
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Definition 129 Let RCM?. Then we define the area of R to be 

A{R) = j j Axdy. 

{x,y)eR 

Quite what this definition formally means, (given that integrals will not be formally defined until Trinity 
term analysis), and for what regions of R it makes sense to talk of their "area", is actually a very complicated 
question. But for the simple examples we shall give this definition will be clear and their area unambiguous. 



Theorem 130 Let f : R 

that 



S be a bijection between two regions of MP, and write {u,v) 
d {u, v) 



f ix,y) ■ Suppose 



9 {x, y) 



is defined and non-zero everywhere. Then 

A{S) = 



du dv 



iu,v)es 



{x,y)eR 



d {u, v) 



d{x,y) 



dx dy 



Example 131 // we return to the case of calculating the area of the disc x-^ + y"^ < a^, now using polar co- 
ordinates we have a much simpler double integral. The interior of the disc, in polar co-ordinates, is given 
by 

O^r^a, O^e <2tt. 



So 



1 1 '"'^=£o" 





d{x,y) 


'e=o 


d{r,9) 



d0dr 



tr—a 
Jr=0 



r=0 J8=0 
r—a 



[^'^0=0 



rdO dr 

dr 



pr=o 

■ / 

Jr=0 



2iT I rdr 

•=c 

„2 " 



= 27r 



= na 



r=0 



Proof. (Sketch proof of preceding theorem) Consider the small element of area that is bounded by the co- 
ordinate lines u = U() and u = Uq + Su and v = vq and v = vq + Sv. Let's say that / (xq, j/o) = ("''Oj^^o) and 
consider small changes 6x and 6y in x and y respectively. We have a slightly distorted parallelogram with sides 

a = f {xQ + 5x,yo) - f ixQ,yo) ^ ^{xo,yo) Sx, 



f{xo,yo + 5y) - f {xo,yo) 



9/ 
dy 



{xo,yo) 5y, 



ignoring higher order terms in Sx and Sy. As / takes vahics in then the above are vectors in R^. The area 
of a parallelogram in with sides a and b is |aA b| where A denotes the vector product. So the element of 
area we are considering is (ignoring higher order terms) 



df , df ^ 
ox ay 



Now / = {u,v), so /a, = {u^,v^), fy = {uy,Vy) B^iid 



dx dy 



6x dy. 



dx dy 



du dv\ / du dv 
dx' dx J \dy' dy 



du dv du dv 
dx dy dy dx 



k. 
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Finally 



5A 



du dv du dv 
dx dy dy dx 

6xSy. 



6x Sy. 



d{x,y) 



Example 132 A shear parallel to the x-axis is a map of the form 

{x,y) I — > (x + ky,y). 

Note that this can also be written as 



1 k 
1 



and that the determinant of the above matrix is 1, so that shears are area-preserving. 
If we consider the effect of the shear 

' 1 -X/H 
1 

on the co-ordinates (0,0) , {X,H) , {B,0) of the vertices of the triangle in Example 117 we see that they are 
moved to (0, 0) , (0, H),{B, 0) . So we see that we can calculate the general triangle's area by just considering the 
special case when X = Q. 

Indeed, if we were so inclined, we could note that the stretch 

1/B 
l/H 

has determinant 1/ (BH) and moves the triangle to (0, 0) , (1, 0) , (0, 1). Hence the original triangle has area 
BH times this simpler triangle. 



Example 133 Calculate the area bounded by the curves 

2x = l-y^, 2x^y^-l, 8x^l6-y^, 
as shown in the diagram below. 



8x^y^ 



16. 




Solution. We see, if we change to planar polar co-ordinates x = (v? — v'^) /2, y = uv, that the region in 
question is 1 ^ u ^ 2, 1 ^ v ^ 2. Hence the area is given by 

fU=2 ,v=2 ^(^ 

dv du 

f u—l Jv — 1 
ru=2 rv=2 



A 



u—l Jv—1 
u=2 



u=l 
u=2 



d {u, v) 
{u^ + w^) du du 



U V + 



du 



11=1 



I 1 du 



u^ 


7u 


2 


7 


7 


14 


3 




1 


= 3^ 


3 


3 
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Example 134 The area of the sector bounded by the half-lines 9 — a and 9 — 13 and the polar curve r = R{9) 
equals 



A 



r drd0 



0=a Jr=0 



d9 



1 r^=0 



R{9Y <19 

Example 135 Calculate the area bounded by the double folium (see diagram) which has Cartesian equation 
[x^ + y^) = Aaxy'^. 




Solution. If we convert to polar co-ordinates this becomes 

r ~ Aa cos 9 sin^ 6 
and so the area equals bounded by the two leaves equals 

1 r^'^ 

A ^ - lea^ cos^ 61 sin^ 61 d6' 

2 J-tt/2 



16a^ 



7r/2 



sin'' 61 -sin'' 61 d6' 



16a2 (/4 - /g) 



where J„ = J^^^ sin" 9 d9. Now 

In+2 = 



r/2 



sin"+^6'sin6'd6' 



i"+'9cos9]l^^ 



it/2 



{n + 1) sin"- 9 cos^ 9 d9 



= (n + 1) (/„ - In+2) ■ 

Hence In+2 = ('^ + 1) In/ {n + 2) and so 



Finally then 



A ^ IQa^ {h - h) = 16a2 x 



16 " "96" 



= TraM 3 - 



96 

15 \ 7ra2 
"6 
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Example 136 To calculate the volume of the sphere x'^ + y'^ + ^ , it seems most natural to use spherical 
polar co-ordinates. The interior of the sphere, in spherical polar co-ordinates, is given by 



So 



A = 



pr—a />0— 27r pO—TT 



dx dy dz — 



r=0 J(p=0 



=0 



d{x,y,z) 



d{r,(b,0) 



dOdcpdr 



r=0 Jtp=0 J0=O 
31 r=a 



r=0 



0=0 



r sin 6 d9 d4> dr 



a'^ 47ra 
— X 27r X 2 = — 
3 3 



3 



Example 137 * Equal Area Projections 

Recall the definition of spherical polar co-ordinates 



X — r sm f cos ( 



which has Jacobian 



y — r sin 6 sin (f), 
d{x,y,z) 



z = r cos t 



d{r,^,9) 



r^ sin ( 



where ^ — 9 is latitude and cj) is longitude. If then we are considering the element of area on the planet's surface 
r = a, represented by 9 and ip, then the local area distortion is sin 6'. 



The Albers Equal-area Conic Projection is an equal-area projection. In terms of 9 and (f> the projection is 

X = —VC— 2n cos 9 sin n(j), U — Pn VC* — 2n cos 9 cos ncj), 

n n 

where n, C, Pq are constants. Then the Jacobian of the projection is 

■■ sin n(f> 



d{x,y) 




det ^ 









\/C — 2n cos 9 cos ncj) 
\JC — 2n cos 9 sin n<f) 

cos^ ncj) sin 9 + sin'^ n(f> sin 9 
sin 61, 



VC-2iicos6l 

2n sin B 

VC-2ncose 



cos ncj) 



SO that equal areas on the map represent equal areas of the planet. 
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9.3 Exercises 



Exercise 170 Let u{x,y) = xy and v{x,y) = y/x. Calculate the Jacobian d{u,v) /d{x,y). Show that the 
area in the first quadrant {{x,y) : x > 0,y > 0} bounded by the curves 

y = 2x, 2y = x, xy = 2, xy = 8, 

equals 6 In 2. 

Exercise 171 Calculate the Jacobians and , and verify that d^(^'v] = ^ in eo^ch of the following 

cases: (i) u = x + y, v = y/x; (ii) u = x'^/y, v = y^/x. 

Exercise 172 The variables u and v are given by 

u = x"^ — xy, V = y'^ + xy, 

for all real x,y. Find the partial derivatives Ux,Uy,Xu,yu in terms of x and y stating the values of x and y for 
which your results are valid. 

Exercise 173 Given that u, v, w are functions of the independent variables x, y, z for which 

u + V + w — X, vw + wu + UV = y, uvw = z, 
find the partial derivative du/dx in terms of u,v,w only. 

Exercise 174 * Given orthogonal curvilinear co-ordinates as defined in Exercise 164 show that 

d {x, y) 



d{xi,X2) 



= /li/l2- 



Exercise 175 Show that the area of the region bounded by circular arcs r = R and r = R + 5R and the rays 
= a and 6 = a + 6a equals 

RSRSa 

ignoring third order terms in 6a and 6R. 
Exercise 176 Evaluate 

/ ' 

In each case sketch the area of integration. 
Exercise 177 * Prove that the double integrals 

"1 / /"l ',-2 _ „2 \ /-l / /"l ,-'2 _ ,,,2 



/ / dydx, / / da;dy, / / dydx, / / 
Jo Jo Jo Jv/2 Jo Jo Jo Jo 



dxdy. 



Jo (i wrh'"') Jo Uo #t?f'") 

exist and evaluate them. 

Exercise 178 Calculate directly the area of the quadrilateral with vertices (1, 3) , (3, 3) , (2, 6) , (6, 6). Perform 
this calculation with both possible orders of Cartesian co-ordinates. 

Exercise 179 Let a> 0. The curve with polar equation 

r = a{l+cose), 0<6'<27r, 

is called a cardioid. Sketch the curve and show that the area bounded by it equals '^ira^. 
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Exercise 180 Let a> 0. Sketch the curve 



,2/3^^2/3^^2/3 



in the quadrant x,y > 0. 

The variables u and v are given in terms of x, y by 

x = ucos'^v, y = usm'^v. 

What is the equation of the curve in terms of the new co-ordinates u and v? Calculate the Jacobian g^^'^j and 
hence find the area of the region bounded by the curve and the positive x- and y-axes. 

Exercise 181 By considering an appropriate stretch find the area of the ellipse 

^ + ^ = 1 
a2 ^ 62 



Likewise, find the volume of the ellipsoid 



Exercise 182 Calculate the double integral 



t/2 ^2 
1- — H = 1. 

Tf 2 ^2 /^2 



/> oo /-■ oo 
-OO J — (. 



-oo J — oo 



via a change to polar co-ordinates and deduce that 

/oo 
e~^^ ds = ^/7^. 
-oo 

Exercise 183 * The mean value of a function f defined on a region V is given by the formula 

Find the mean value of the function xyz{x^ +y'^ + z^) in the region V given by 

V = {{x, y, z) G M.^ : x'^ + y"^ < 1,0 < X < y, -1 < z < I}. 
How would you define the median of a function f defined onV? 
Exercise 184 * Evaluate the integral 

J J {x^ + y^f dxdy 

A 

where A is the region in the positive quadrant enclosed by the curves 

x^ — 3xy^ = a, x^ — 3xy^ = b, Sx^y — y^ = c, 3x^y — y^ = d 
where < a <b and < c < d. 

Exercise 185 * Find, using polar co-ordinates the centre of mass of a uniform quadrant of a circle of radius 
a. 

Exercise 186 * Find the volume of the region satisfying the inequalities 

a:^ +2^ < a^, h ^ z ^ a. 
Hence find the height of the centre of mass of a uniform solid hemisphere of radius a. 
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Exercise 187 * Let b > a > 1. Consider integrating the function e over the range (0,oo) x (a, 6) in two 
different ways; assuming the order of integration does not matter show that 



f 

Jo 



- ax — log 



Exercise 188 * The Bessel function Jo {x) can be defined for a; € R as 



2 f ' 

Jo {x) = — I cos {x cos 6) AO. 
Ti" Jo 



Assuming the order of integration does not matter, show that for p > 0, 

/ Jo {x) e-f^ da; = 
Jo 
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10. THE GRADIENT VECTOR 



10.1 Standard Curves and Surfaces 



In the next section we will meet the gradient vector V/ of a scalar function /. The gradient vector is a way 
of calculating the normal of a curve, or surface, / = c (where c is a constant) defined by a function. Here we 
introduce a few standard examples of some curves and surfaces 

Example 138 (Conies) The standard form of equation of the conies (circle, ellipse, parabola, hyperbola) are 

• circle: + = a^; parametrisation (a cos 0, a sin 0); area vra^. 

• ellipse: jo} + jb"^ ib < a); parametrisation (a cos 6, bsinO); eccentricity = 1 — b"^ /a^ ; foci (±ae, 0); 
directrices x — -iia/e; area nab. 

• parabola: — Aax; parametrisation {cdP' , 2at^ ; eccentricity e — 1; focus (a, 0); directrix x — —a. 

• hyperbola: x'^/a^ — jlp' = 1; parametrisation (asect, 6tani) or (zbacosht, bsinhf); eccentricity ~ 
1 + b'^ /a^; foci (±ae,0); directrices x — zLa/e; asymptotes y — ibx/a 




a 

ellipse hyperbola 

Example 139 Find the point of the ellipse x^ /A + y^/9 = 1 which is closest to the point (5/\/2, 5/\/2) . 

Solution. In order to solve this we can turn the question into a one-variable maximisation problem by para- 
metrises the ellipse as (2 cost, 3 sin The distance D [t) of this point from (5/a/2, 5/a/2) is given by 



D{tf 



V2 



2 cost 



V2 



3sint 



If we differentiate this we get 



2D^ = 2 - 2cost^ (2sint) +2 - Ssint ) (-3cost) = 0. 



THE GRADIENT VECTOR 



103 



This rearranges to 



\pi sin t — cos t ( —= — sin t 
.V2 



By inspection we see that when t ~ -k/A and sini — cost = l/\/2 the equation holds and the closest point is 

2cos-,3sm-j = (^V2,-^ 



Example 140 (Examples of Quadrics) The standard form of equation of the quadrics are 

• Sphere: + + ~ ; 

• Ellipsoid: x^ja^ + j}? + z^/c^; 

• Hyperboloid of One Sheet: x'^/a'^ +y'^ /b'^ — jc? = 1; 

• Hyperboloid of Two Sheets: x^/a"^ — — /c^ = 1; 

• Paraboloid: z ~ x"^ + y"^ ; 

• Hyperbolic Paraboloid: z = a;^ y^; 

• Cone: x'^ + y'^ — z^ . 





one sheet hyperboloid hyperbolic paraboloid 

Example 141 Find the tangent and normal to the ellipse x^ jcP' +y'^/b'^ = 1 at the point {X,Y) 
Solution. One way would be to implicitly differentiate the equation and obtain 

2x 2y dy 



b^ dx 



= 



giving dy/dx ~ —b'^X/ (a^F) at the point in question. Then the normal has gradient a'^Y/ And so the 

two equations are 



y-Y 

y-Y - 



-b^X 



a'^Y 
a^Y 



(x — X) (tangent) 



b'^X 



{x — X) (normal) 



The tangent's equation simplifies somewhat to 

h^Xx + a^Yy = a^b^ . 
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Alternatively we know that a parametrisation of the ellipse is given by 

r [t) = (a cos t, b sin t) . 
So a tangent vector to the ellipse at r (t) equals 

r' (t) = {—a sin t, b cos t) . 
This is a vector in the tangent direction. Hence a vector in the normal direction is 



. , fbX aY\ ba f2X 2Y\ 
(6cosMsint) = (^-, - j = - (^^, j 



Quite why the last term has been written that way will become clearer once we have met the gradient vector. ■ 

We probably all feel we know a smooth surface in when we see one, and this instinct for what a surface is 
will be satisfactory for the purposes of this course. We recognise all the previous examples as smooth surfaces, 
with the exception of the cone at (0, 0, 0) and could well be comfortable working out their tangent planes and 
normals. In the main we will be happy to work with surfaces without a formal definition, but for those seeking 
a more rigorous treatment of the topic, the following might provide a suitable working definition. 

Definition 142 * A smooth parametrised surface is a map r, known as the parametrisation 

r : U -^M^ : {u,v) {x {u, v) ,y{u,v),z {u, v)) 
from an open subset U CM? to such that 

• x,y,z have continuous partial derivatives with respect to u and v of all orders 

• r is a bijection with both r and being continuous 

• at each point the vectors 

dr dr 

— and — 
ou ov 

are linearly independent (i.e. are not scalar multiples of one another). 

We will not be looking to treat this definition with any generality. Rather we shall just look to parametrise 
some of the "standard" surfaces previously described and calculate some tangents and normals. We define: 

Definition 143 Let r : U be a smooth parametrised surface and let p be a point on the surface. The 

plane containing p and which is parallel to the vectors 

dr dr 
-(p) and -(p) 

is called the tangent plane to r (U) at p. Because these vectors are independent the tangent plane is well- 
defined. 



Definition 144 Any vector in the direction 



dr dr 



is said to be normal to the surface at p. There are two unit normals of length one. 

Example 145 We have already met a parametrisation for the sphere + + 2;^ = with spherical polar 
co-ordinates. This given by 

r {<p, 9) = {a sin 9 cos 0, a sin 9 sin cj), a cos 9) . 
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We already know the outward-pointing unit normal at r {0, (j)) is r {0, (j>) /a but let's verify this with the previous 
definitions and find the tangent plane. We have 



{—a sin 6 sin ^, a sin 6 cos (j>, 0) , 

(a cos 6 cos a cos 9 sin </>, — a sin 6) . 



Hence 







i j 




k 




— asinffsini/) asin^ 


coscp 







a cos 


cos 4> a cos 


sint/) 


— asin( 






i 


j 


k 


= a sm9 


— sint/) 


COS0 






dr 

dr 
86 



dr dr 

d4>^de 



COS COS (f) COS sin (j) — sin 

(— sin cos ^ 
— sin 6 sin ^ 
— cos^ 

and so the two unit normals are ± (sin 9 cos </>, sin 9 sin 0, cos ^) . The tangent plane at r {(p, 9) is then 

r- (sin 9 cos (j), sin 9 sin 0, cos 9) = a. 

Example 146 Find the normal and tangent plane to the point {X, Y, Z) on the hyperbolic paraboloid z = x^—y^. 

Solution. This surface has a simple choice of parametrisation as there is exactly one point lying above, or 
below, the point {x, y, 0) . So we can take a parametrisation 



We then have 



So 



r{x,y) = {x,y,x'^ - y^) . 



I = (1,0,2.), I = (0, !,-%), 



i 


j 


k 




1 





2x 







1 


-2y 





dr ^ dr 
dx dy 

A normal vector to the surface at {X, Y, Z) is then {—2X, 2Y, 1) and we see that the equation of the tangent 
plane is 

r-(-2X,2y,l) = {X,Y,X^ -Y'^) ■ {-2X,2Y,1) 

= -2X^ + 2Y^ + - y2 

= Y'^ -X"^ = -Z 



or equivalently 



2Xx - 2Yy ^z = Z. 



10.2 The Gradient Vector 



In practice though, for the curves and surfaces we met in the last section, there will be no need to go through 
the rather laborious task of parametrising surfaces to find normals. This is because all these curves and surfaces 
we've seen are examples of level sets and the gradient vector is a simple way of finding normals to such surfaces. 
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Definition 147 Given a scalar function / : K" ^ K, whose partial derivatives all exist, the gradient vector 

V/ or grad/ 

is defined as 

The symbol V is pronounced "grad " usually, but also as "del " or "nabla ". 
Example 148 Let 

f {x, y, z) = 2xy'^ + ze^ + yz. 

Then 

V/ = (22/2 + 4x2/ + z, + y) . 

Example 149 Let 

9{x,y,z) = x'^ +y'^ + z'^. 

Then 

Vg=i2x,2y,2z). 
Note that Vg is normal to the spheres g = const. 

Example 150 Consider the vector field 

V {x,y,z) — (2xy + z cos (zx) , x? + e^~^, x cos {zx') — e^~^) . 
Find all the scalar functions f {x, y, z) such that v = V/. 
Solution. For any such / we have 

— = 2xy + z cos [zx] 
ox 

and hence 

/ {x, y, z) = x^y + sin {zx) + g {y, z) , 
for some function g of two variables. Now df jdy =^ x^ ^- e^~^ and differentiating the above gives 

oy 

so that 

g{y,z) = e^-' + h{z). 
Finally df jdz = a; cos (zx) — e^~^ and differentiating the above with respect to z gives 

X cos [zx) — e^~^ + /i' [z) = X cos (zx) — e^~^ 

so that h' (z) = and /i is a constant. So the given function / has the form 

/ (x, y, z) = x^y + sin (zx) + e^^^ + const. 

■ 

The formula for V in planar polar co-ordinates is as foUows:- 
Proposition 151 (Planar Polar Co-ordinates) Let 

er = {cos9,sm9) , ee = {— sm6,cos9) . 
These are perpendicular unit vectors, pointing along the curves r = const, and 6 = const. Further 

^ , 90 ld(j) 
or r oO 
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Proof. It is easy to check that 



e^. • = cos^ 9 + sin^ ^ = 1, 
e0 ■ eg = sin^ 6 + cos^ = 1, 
er ■ eg = — sin 6 cos ^ + sin ^ cos ^ = 0. 

The hne 9 = c\s parametrised by 9 (r) = (r cos c, r sin c) and we have 9' (r) = e,.; similarly r {9) = {a cos 9, a sin 9) 
and we see r' (9) = aeg. Recall that the relationship between r, 9 and x, y are given by 

r~o o X ^ dr y . . 

r = va; +v, 7^ = — = cos 9, — = — = sm 9, 

^ ^ dx dy ^/^^Ty^ 

y 39 —y — sin0 39 x cos 9 
9 = tan — — — — 



and so 



X ' 3x + r ' dy x"^ +y'^ 



d(t>. d(f). 

dr d^d9\ .^^ / d^dr^ ^ d^d9_'' 



dr dx 89 dx J \ dr dy d9 dy 

dd) , . d(h { — sm9. cos 9. 

^(cos^i + sm^j) + -(^— 1+— J 

dcp 

dr r d9 



Definition 152 Let / : K" — > K 6e a differentiable scalar function and let u be a unit vector. Then the 
directional derivative of f at a in the direction u equals 



lim 



/(a+iu)-/(a) 



t^o \ t 

This is the rate of change of the function f at a. in the direction u. 

Example 153 Let f {x, y, z) = x^y — and let a = (1, 1, 1) . Calculate the directional derivative of f at a in 
the direction u = {ui,U2,U3). In what direction does f increase most rapidly? 



Solution. Now 



/(a+fu)-/(a) 



{l+tUif{l+tU2)-{l + tU3f 



t t 

= (2ui +U2- 2U3) + (ul + 2U1M2) t + u\u2t'^ 

2ui +U2 — 2u3 as f ^ 0. 

This is the directional derivative asked for. What is the largest this can be as we vary u over all possible unit 
vectors? Well in the direction (|, 5, the directional derivative is 

4 1 4 

3 + 3 + 3=^- 

On the other hand 

2ui +U2- 2u3 = (2, 1, -2) • u = 3 |u| cos 6* = 3 cos 9 
(where 9 is the angle between u and (2, —1, 2)) takes a maximum of 3 when 9 — and u = (2/3, 1/3, —2/3) . ■ 

Proposition 154 The directional derivative of a function f at the point a in the direction u equals V/ (a) • u 
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Proof. Let 

F{t) = f (a+tu) = / (ai + tui, . . . , a„ + tUn) ■ 

Then 



(-/(a+tu)-/(a)^^ ^ ^^^^^ (^FW-F(0) ^ ^ . 



Now, by the chain rule, 



t=o 



5/ da;i df dx„ 

= v^) ^ ^ "a — y^) ~rr 

dx\ dt ax„ At 

= ^(a)n, + ... + — (a)n„ 
= V/(a)-u. 

■ 

Corollary 155 The rate of change of f is greatest in the direction V/, that is when u = V// |V/| . 

Definition 156 A level set of a function f : ^ M. is a set of points 

{{x,y,z)eM.^ : f{x,y,z) = c} 

where c is a constant. For suitably "nice" functions f and constants c the level set is a surface in M^. Note that 
all the quadrics given in Example I40 are level sets. 

Proposition 157 Given a surface 5 C with equation f{x,y,z) = c and a point p £ S then V/ (p) is 
normal to S at p equals. 

Proof. Let u and v be co-ordinates near p and r : («, (■) r (u, v) be a parametrisation of part of S. Recall 
that the normal to 5 at p is in the direction 

dr ^ dr 
du dv' 

Note / ((r (u, v))) = c and so df /du = df /dv = 0. If we write r {u, v) = {x {u, v) ,y{u,v),z {u, v)) then we see 
that 

-ft - 

df dx df dy df dz 
dx du dy du dz du 
df 

= — [by the chain rule] 



[df 


dl 


df\ 


fdx 


dy 




\dx 


dy 


dz J 




' du 


du) 



du 
= 

as / is constant as a function of u and v. Similarly V/ • dr/dv — and hence V/ is in the direction of 
dr/du A dr/dv and so normal to the surface. ■ 

Example 158 In Example I46 we determined the normal at {X, Y, Z) to the hyperbolic paraboloid z = x^ — y^. 
If we introduce the function 

f{x,y,z) =x^ -y^ -z 

then we can know the normal is 

V/ iX,Y,Z)^{2X,-2Y,-l) 
which is find the normal to vector we found previously. 
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Example 159 Find the point on the ellipsoid 



y'^ 2 



which is closest to the plane x + 2y + z = IQ. 



Solution. The normal to the plane is parallel to (1, 2, 1) everywhere. The point on the ellipsoid which is closest 
will also have normal (1, 2, 1). If we set 



then the gradient equals 



/(x.J,,2) = ^ + ^+z'-l, 



If this is parallel to (1, 2, 1) then we x = 2A, y = 9A, z = A/2 for some A. This point lies on the ellipsoid when 

(2Af (9Af 
4 9 I 2/ 



giving 41A /4 = 1, and so A = 2/-\/4T. The closest point then is 

4 18 1 



41 V41 V41 



Example 160 The temperature T in is given by 

T{x,y,z) =x + y'^ -z'^. 

Given that heat flows in the direction of—VT, describe the curve along which heat moves from the point (1, 1, 1). 
Solution. We have 

-VT = {-l,~2y,2z) , 

which is the direction in which heat flows. If we parametrise the flow of the heat (say by arc-length s) as 

r (s) — {x (s) ,y{s),z (s)) then we have 

y (s) z (s) 

as {x',y',z') and (— 1,— 2y, 2^;) are parallel vectors. Integrating the last two equations we get 

Iny(s) = 2a; (s)- 2 
ln2;(s) = -2a;(s) + 2. 

Hence the equation of the heat's path is 

2x = In J/ + 2 = 2 - In 2:, 

moving along the direction of (1, 2, —2) from (1, 1,1). ■ 

As you might expect V satisfies a selection of product, quotient and chain rules: 

Proposition 161 Let f and g be functions ofx,y,z. Then 

ft) V(/5)= fVg + gVf 

fit) V (D = n/"-iV/ 



9 J g2 



(iv) V(/o5)(x)= /'(5(x))V5(x) 
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Proof. See Exercise 201 for (i), (ii), (iii). The proof of (iv) follows from the chain rule:- 

V(/off)(x) = (^/(5(x)),^/(5(x)),^/(5(x))) 

= (^f'{g (x)) If W , / ' (5 (x)) ^ (x) , / ' (5 (x)) g (x) 
= /'(5(x))V5(x) 



Example 162 * There are two important operations related to grad V. For a vector function v = {vi,V2,V3) 
we define 



i 


j 


k 


a_ 


d_ 


d 


dx 


dy 


dz 


Vi 


V2 


V3 



This is also written curlv. 
Secondly 



V Av = 



„ dvi dv2 dv3 

V • V = — =- H H 

dx dy dz 



which is also written divv. 

Example 163 Perhaps unsurprisingly these operations satisfy the identities 

V-(VAv) = 
V A V/ = 

for any vector function v and scalar function f. To prove the latter we see 

VAV/ = 



i 


j 


k 


_d_ 


d_ 


d 


dx 


dy 


dz 


dl 


Of 


df 


dx 


dy 


dz 



The Laplacian V given by 



can also be rewritten as 



{fzy fyz)^'^{fxz fzx)3'^ifyx /a:j/) ^ — 0. 



^2, d^f d^f 
y^f = — - H H — - 

dx^ dy"^ dz"^ 



VV = V-V/= div(grad/) 



which explains its notation as V . 

Example 164 * Recall in Example 150 we introduced 

V (x, y, z) = {2xy + z cos {zx) , x^ + e^~^, x cos {zx) — e^~^) 
and noted that v = V/ where 

f {x, y, z) — x^y + sin {zx) + e^~^. 

From the previous example it follows that curlv = 0. However, such scalars don't always exist; for example 
there is no f such that V/ = w where 

w = {y,z,x). 



Note that 



V A W : 



i j k 

dx dy dz 
y Z X 



(-1,-1,-1)7^0. 
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In fact, the converse of\7 A V/ = does hold in K^. That is, i/F is a vector-valued function on M.^ such that 
curlF = then there exists a scalar function </>, known as a potential such that F = V(/>. 
The potential cj) can be defined as 



'(P) 



/' 

^0 



F(tp) -pdi. 



So in the case ofv this equals 
-1 



< [2t^xy + tz cos (t^zx)] x + 



—a; y+ - sm [t zx) + —x y + 

2 



txcos (fzx) -e*(^-^)] 



dt 



ye 



t(v- 



y- z 



+ - sin (t^zx) 



ze 



t{v-z) 



y- z 



X y + sin (2:2;) + ^. 



10.3 Exercises 



Exercise 189 * Find the equations in Cartesian co-ordinates of: 

(i) the ellipse with foci at (±2,0) which passes through (0, 1); 

(a) the hyperbola with asymptotes y = ±2x and directrices a; = ±1/ 

(Hi) the ellipse consisting of all points P such that \AP\ + \BP\ = 10, where A is (3,0) and B is (—3,0); 
(iv) the parabola with directrix x + y =1 and focus (—1, —1). 

Exercise 190 * Let i = (1,0) and'} = (0,1) denote the canonical basis ofM?, and set ei = (i+j)/-\/2 and 
e2 = (j — i)/-\/2. Show that {ei, e2} is an orthonormal basis for M?. 

Given X,Y gW determine x,y gM. such that Xei + Ye2 = xi-\-yj. 

Deduce that the curve x^ -\- xy + y"^ = 1 is an ellipse and find its area. 

Exercise 191 Show that the point p{t) = {a sec t,b tan t) lies on the hyperbola H given by x^/a'^ — y^ jl? = 1. 
Give a domain S CR for t so that p : S ^ H parametrises the hyperbola. 

Write down the equation of the tangent line to the hyperbola x^/a"^ — y'^/b^ = 1 at the point {asect, 6tant). 

Determine the co-ordinates of the points P and Q where this tangent line meets the asymptotes of the hy- 
perbola, and show that the area of the triangle OPQ is independent oft. 

Exercise 192 * The conic C is formed by intersecting the double cone x'^+y'^ = z^ with the plane x+y + z = 1. 
Show that the point with position vector 

r{t) = (1 + (secf- tant)/A/2, 1 + (sec i + tan t) /a/2, -1 - \/2sect) 

lies on C. 

(b) Show that the vectors ei = (l/-\/6, l/v^, — -^2/3) and e2 = (— 1/-\/2, l/-\/2, 0) are of unit length, are 
perpendicular to one another and are parallel to the plane x + y + z = 1. Show further that 

r{t) = (1, 1, —1) + (aseci)ei + (6tant)e2 

where a and b are positive numbers to be determined. 

(c) Show that C has eccentricity 2/-\/3, has foci (1, 1, —1) ± 2ei and that the directrices, in parametric form, 
are (1, 1, -1) ± 3ei/2 + Ae2. 

Exercise 193 * A straight line L through the focus F of a conic C meets C in two points A and B. Show that 
the quantity 

1 1 

\AF\ ~^ \BF\ 

is independent of the choice of L. [Hint: this question is best done using polar co-ordinates.] What is this 
quantity for the parabola y"^ = Aax ? 
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Exercise 194 * Find parametrisations for the following surfaces: 

i) the cylinder + = a^, a > 0, 

ii) the paraboloid z = , 

Hi) the ellipsoid x'^ / + ip /b'^ + / = 1, a, 6, c > 0, 

iv) the hyperholoid of one sheet x^/a^ + — x^/c^ = 1, a,b,c > 0, 

v) the surface generated by rotating the curve y — f{x) about the x-axis. 

Exercise 195 * A torus in has parametrisation 

r{u,v) = ((a + bcosv) cosu, {a + bcosv) sin u, 6 sin t;) 

for u,v E (0, 2n). Write down an equation for the torus in terms of x, y, z. 
Calculate dr/du and dr/dv. The torus' surface area is defined by 

vl = / / \dr/du A dr/dv\ dudv. 
Jo Jo 

Show that the torus has area Air'^ab. 

Exercise 196 * Let r {u, v) = s {a, b) be two different parametrisations for the same surface. Use the chain 
rule to show that r„ A r„ and Sa A Sb are parallel — hence the definition of a normal does not depend on the 
choice of parametrisation. 

Exercise 197 Show that there is no function f {x, y, z) such that V/ {x, y, z) = {y, z, x) . 
Exercise 198 Find the gradient V/ when (i) f {x,y,z) = zsiny — xz; (ii) f {x, y, z) = ze^ cosy. 
Exercise 199 Suppose that a hill has equation 

oo 1 2 -'-2 

where z is the height above sea-level in metres. Sketch a contour map (i.e. draw on the same axes a set of 
curves z = constant) with various contours shown. 

If you start at the point (3, 2) and walk in the direction i + j are you going uphill or downhill? In what 
direction (in three dimensions!) are you actually travelling? If you are travelling at v ms'^ what is your 
vertical speed? 

Exercise 200 Suppose that the temperature T in the xy-plane is given by 

T{x,y) = xy-x. 

(a) Sketch, on the sam,e axes, a few isothermal curves (curves of constant temperature). 

(b) Find the direction in which the temperature changes most rapidly at (2, 2) . Find the directional derivative 
ofT at (2, 2) in the direction 3i — 4j. 

(c) Heat flows in the direction — VT (perpendicular to the isothermals). Express this as a differential equation 
in y and x and solve it to say how heat would floiv from (2, 2) . 

Exercise 201 Let f and g be functions of x, y,z. Show that 



(^) 


V ifg) : 


= /V,9 + ,9V/, 


(ii) 


V(/") 




(Hi) 




gV/ - fVg 
9' 



(iv) The Laplacian is defined by 

d^f d^f 

V / = — - H H -. 

dx"^ dy"^ dz^ 



Show that 

(v) Let a. be a constant vector and let r = (x, y, z). Calculate V (a • r) 



V' ifg) ^fV'g + 2 (V/) • (V5) + gVV- 
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Exercise 202 Let r = {x, y, z) and r = |r| = •\/ir^"+"y2Tjr^_ Show that for functions / : M — > K and : — > M 

V(/(g(r))) = /'(5(r))V5(r). 
Using this result (with g (r) = r), find VF when 

(i) F{r)=r, (ii) F(r)=lnr, (Hi) F{r) = ^. 

Exercise 203 Show that in terms of spherical polar co-ordinates 

df 1 df Idf 

or r sm d o<p r aO 

where er,eg,erj, are as given in Exercise 162. 

Exercise 204 * Given orthogonal curvilinear co-ordinates in the plane (see Exercise 164) show that 

hi OXi /l2 0x2 

Exercise 205 Let f {x,y) = x^y^. What is V/? Determine 

t^o t 

where r = (1,1) and n = (a, h) is a unit vector so that 0^+6^ = 1. What is the maximum value of the limit that 

can he obtained by varying the vector n? 

Exercise 206 Find a unit vector normal to the surface xy"^ + 2yz = A at the point (—2,2,3) . 

Exercise 207 Find the angle between the sphere x^ + y^ + z"^ = 2 and the cylinder x'^ + y^ = 1 at any point 
where they intersect. 
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11. CRITICAL POINTS 



In this final chapter we use the gradient vector to find the critical points (or stationary points) of a real function. 
The idea of a critical point is the multivariate generalization of a stationary point of a one-variable function, 
but critical points in several variables can arise with a more complicated variety. Consider the function 

k {x, y) — sin^ x + sin^ y — 4 sin^ x sin^ y (11-1) 

and its graph z — k{x,y) (see below) for the range ^ x, y ^ tt. 




The function has critical points of different kinds as we shall see. The most obvious, and one easily defined, is 
the minimum in the middle at (7r/2, n/2) . 

Definition 165 Let f : M" ^ M 6e a real-valued function. Then f has a local minimum at p — {pi, . . . ,Pn) 
if there exists £ > such that 

f (x) ^ f (p) whenever \xi — Pi\ < s for each i. 

That is, if there is a small n-dimensional cube (or equivalently n- dimensional hall) about p such that f gets no 
smaller than at p in this cube. 

Similarly we define f to have a local maximum at p = (pi, . . . ,p„) if there exists e > such that 

f (x) ^ / (p) whenever \xi — Pi\ < s for each i. 

It may, of course, be the case that / gets greater/smaller at other points in M". But we can define: 

Definition 166 Let f : M" -^M. be a real-valued function. Then f has a global minimum at p = {pi, . . . , p„) 
if 

f{^)^f{p) for all^(iW-, 
and f has a global maximum at p — {pi, . . . ,Pn) if 

/(x) ^ /(p) for allyiE W\ 

Suppose now that n = 2 and that / has partial derivatives of all orders with respect to x and y. If / has a 
local minimum at p then in whatever direction u — {ui,U2) we move from p the function cannot decrease. If 
we take the cross-section of the graph z ~ f {x, y) by the vertical plane through p and parallel to the vector u 
we would have a graph in the te-plane of 

■2 = /(Pl +tUi,P2 +tU2) 
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and we know that at such a minimum dz/dt = 0. Below are graphs of ^; = fc (pi + tui,p2 + tu2) for the function 
(11.1) with p = (7r/2, 7r/2) and different values of u. 





u = (l,0) 
Recall from the chain rule that 



-0J5 O.S 

u = {I/V2, 1/V2) 



dt 



df 



df 



Ul^ (p) + U2^ (P) = U • V/ (p) 



t=0 



dx 



dy 



is the directional derivative of / at p in the direction u. As this is true for all vectors u then in particular it 
is true when u = V/ (p), and hence it follows that |V/ (p)|^ = and so V/ (p) = 0. This motivates our next 
definition: 

Definition 167 Let f : ^ W be a real-valued function which is differentiable with respect to each variable. 
A point p is said to be a critical point and f (p) is said to be a critical value if 



dxi 



dXr,, 



= 0. 



It's clear from the above that local maxima and minima are critical points but there are other types. 
Example 168 Determine the critical points of 

/ \ A. A. 

k (a;, y) = sin a; + sin 2/ — 4 sin x sin y. 

Solution. We have 

dk/dx = 4sin'^ xcosa; — 8sina;cosa;sin^ y = sin2a; (— 1 — cos2a; + 2cos2y) , 
dk/dy = 4sin^ y cosy — 8sin^ xsiny cosy. = sin2y (— 1 — cos2y + 2cos2a;) . 

There are various cases to consider. 

(i) sin2x = and sin2y = 0; so {x,y) = {mw/2,mr/2) for integers m,n G Z. 

(ii) sin2.T = 0, cos 2.x = 1 and cos2y = —1. 

(iii) sin2y — 0, cos2y = 1 and cos 2a; = —1. 

(iv) and in the final case 



■ cos 2x + 2 cos 2y = 1, 



■ cos2y + 2cos2x = 1, 



so that cos 2a; = cos2y = 1. Note that these last three cases are already included in case (i). ■ 

The function k {x, y) is period with period tt independently in the two variables. If we sketch the graph of 

z = sin* x + sin* y — 4 sin^ x sin^ y for the range ^ a;, y < tt 

we see that the four critical points in this range 

(0,0),(^/2,0),(0,7r/2),(7r/2,7r/2) 

are quite different in nature. Let's consider them separately :- 
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• (0,0): At this point k is 0. We can get a sense of how k behaves near (0,0) by moving off in various 
directions from (0,0). If we move along the a;-axis we see 

k {t, 0) = sin^ t, 

a cross-section of the graph which is a minimum. However if we move along the y = x line we note 

k{t,t) = -2sin^t, 

a cross-section which yields a maximum. So depending on the direction we move from (0, 0) the function 
may increase or decrease. In this case (0, 0) is an example of a saddle point. 

• {n/2, 0): At this point k is 1. If we move along the x-axis we see 

k{n/2 + t,0) =cos'^t, 

a cross-section of the graph which is a maximum. If we move parallel to the y-axis we also note 

k (7r/2, t) = l + sin^ t - 4 sin^ t, 

which is also a maximum as 

^fc(7r/2,t) = Usm^tcos^t- Asm^t + d,s.m^t-d,cos^t = -8 < when t = 0. 

In fact we shall see that however we proceed from {n/2, 0) the function decreases. By symmetry the same 
is true at (0, 7r/2) 

• (7r/2,7r/2): At this point fc is — 2. If we move along the a;-axis we see 

k (7r/2 + t,TT/2) = cos^ i + 1 - 4 cos^ t 

which is a cross-sectional minimum as 
d^ 

-r^k (n/2 + 1,^/2) = 12cos^isin^f-4cos'^i-h8cos^f- Ssin^t = 4 > when t = 0. 
dt^ 

In fact we shall see that however we proceed from {n/2, n/2) the function increases. 

So suppose we are in the general situation of having a graph z = g{x,y) with a critical point at p = (pi,p2) . 
Consider the function 

h (t) = g {pi + uit,p2 + U2t) 

where u = (ui, W2) is a unit vector. A vertical planar cross-section of z through p and parallel to u would yield 
a graph oi z = h{t). By the second-order chain rule we have 



{uif Quu (p) + 2uiU2guv (p) + ("2)^ 9vv (p) • (11-2) 



t=o 



We are interested in the sign of (11.2) as we consider all possible unit vectors u. In the case of a maximum 
(resp. minimum) the above should always be negative (resp. positive) whilst in the case of a saddle point both 
signs should be possible. If W2 7^ we can rearrange (11.2) as 



— ] 3«« (p) + 2 ( — ) guv (p) + 9vv (p) 

U2 J \U2 



then we have (^2)^ > and the second bracket is a quadratic in u\/u2- If we consider the discriminant (i.e. 
"6^ — 4ac") we see it equals 

4 {guv {vf - 9uu (p) 9vv (p)) 

and so the second bracket: 
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• is always positive if (p) < Quu (p) 9vv (p) and Quu (p) > (the second bracket represents a U-shaped 
parabola with no real roots); 

• is always negative if guv (p)^ < guu (p) dw (p) and guu (p) < (the second bracket represents a fl-shaped 
parabola with no real roots); 

• takes positive and negative values if g,,,, (p)^ > guu (p) 9vv (p) (the second bracket represents a parabola 
which crosses the horizontal axis twice). 

Note that if U2 = then we have a further cross-sectional minimum or maximum to add to the first or second 
case depending on whether g^u (p) is positive or negative respectively. 

Definition 169 Let p gM? be a critical point of the function g : ^ R. We say that p is a non-degenerate 
critical point if 

ijuv 

Remark 170 It is beyond the scope of this chapter to discuss degenerate critical points at any length though 
examples of such are discussed in Examples 173 and 175. 

So in the prequel to Definition 169 we showed: 

Theorem 171 If p is a non-degnerate critical point of the function p : — > M then: 
(i) p is a maximum if guv {pf < guu (p) gw (p) and guu (p) < 
(a) p is a minimum if guv (p)^ < guu (p) Qw (p) and guu (p) > 
(Hi) p is a saddle point if g^ {pf > guu (p) gw (p)- 

Example 172 Find the critical points of the function 

f{x,y)=x'^+2xy-y^+y^, 

and classify them as maxima, minima or saddle points. 

Solution. Firstly the simultaneous equations 

f, = 2x + 2y = 0, fy = 2x-2y + iy^ = 0, 

imply = 4y and hence the function has two critical points (0,0) and (—4/3,4/3) . Now 

fxx — 2, fxy — 2, fyy = 6y — 2. 

So, to classify the critical points we note: 



Point 






■fijij 


/j'.i' ■/;/,'/ fxij 


Tyi)(> 


(0,0) 


2 


2 


-2 


-8 < 


saddle, 


(-4/3,4/3) 


2 


2 


10 


16 > 


minimum. 



Example 173 Returning to the function 

k {x, y) = sin* x + sin* y — 4 sin^ x sin^ y 
from (11-1) we were investigating its critical points at (0, 0) , (7r/2, 0) and {-k/2,-k/2) . Note that 



kx = Isin'^xcosx — Ssinxcosxsin^y, 
ky = 4 sin^ y cos y — 8 sin y cos y sin^ x, 

A. 

kxx = 12 sin a; cos a; — 4 sin a; — 8 cos a; sin y + Ssin a; sin y, 

kxy = — 16 sin y cosy sin x cos a;, 

kyy = 12sin^ y cos^ y — 4sin* y — 8cos^ ysin^ X + 8sin^ ysin^ X. 
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So, classifying the above three critical points we see: 



Point 


h 


h 


h 


r^xxi^yy f^xy 


Type 


(0,0) 














degenerate 


U/2,0) 


-4 





-8 


32 > 


maximum, 


(7r/2,V2) 


4 





4 


16 > 


minimum.. 



Remark 174 Even though (0,0) is degenerate, we can easily eUminate it as a possible maximum or minimum 
by our earlier analysis after Example 168 where we showed that the function both increased and decreased in 
different directions from (0, 0) . 

Example 175 *Find an classify the critical points of 

f{x,y)=x^-y'^, g{x,y) = x'^y + xy'^, h{x,y) = x'^ + 2x'^y'^ + 2y'^. 

Solution, (i) We have fx = 2x = and fy = — = only at (0, 0). At (0, 0) we have f^x = 2, fxy = 0, fyy = 
and so (0, 0) is degenerate. However we can easily see that moving along the a;-axis we perceive (0, 0) as a 
minimum as / {x, 0) — x^, whilst if we move along the j/-axis we perceive (0, 0) as a maximum as / (0, y) = — y^. 
So (0, 0) is a saddle point. 

(ii) Note gx = 2xy + y^ =0 and gy = x^ + 2xy = only at (0, 0) . At (0, 0) we have g^x = 9xy = 9yy = and 
so (0, 0) is degenerate. But it is very easy to see that g {x, y) takes both positive and negative values near 0. 
As g (x, y) = xy {x + y) then g (./;, y) > in the region a; > 0, y > 0, a: + r/ > (say at points (e, e) where e > 0, 
which can be arbitrarily close to the origin), but g {x, j/) < in the region x>Q,y<Q,x-\-y>Q (say at points 
(2e, —e) where e > 0). 

(iii) Note h.x = ix^ + ixy"^ = and hy = ix'^y + Sy^ = only at (0, 0). At (0, 0) we have h^x = h^y = hyy = 
and so (0, 0) is degenerate. We can, however, easily see that 

h {x, y) = (a;2 +y-^f +y'i^o for all {x, y) € 

and so (0, 0) is a minimum, is in fact a global minimum of h (.x, y) . ■ 

Recall that when asked to find the extreme values of a function on a region of the real line (e.g. an interval) 
the maxima and minima may arise in one of two ways: 

• at internal extrema — this is when the function takes its maximum or minimum at a point in (a, b) and 
in this case /' (a;) = at such a point; 

• at external extrema — this is when the function takes its maximum or minimum at either a; = a or a; = 6 

and it need no longer be the case that / ' (a;) = 0. 

By way of example consider the function 

/ (a;) = cos a; for 1 ^ a; ^ 4. 

It is clear, from the graph below, that the function takes its maximum (in this interval) at a; = 7r/2 which is an 
internal maximum, and takes its minimum at a; = 4 which is an external maximum. 




y = cos a; for 1 < a; < 4. 

There are various ways to approach the problem below, including purely geometric ones, but we will treat 
this as an extremum problem of a two variable problem. 
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Example 176 Given a triangle with angles A,B,C, none of which is obtuse, show that 

3\/3 

2 < sinA + sinS + sinC < 

Solution. At first glance this may seem like a three variable problem but of course A + B + C = tt and so in 

fact all possibilities are determined by A and B alone. Now {A, B) cannot taJse all possible values in the plane; 
rather it must be the case that 



0<A< 

and asC = 7r — A— Bis also acute it follows that 

TT 



0<B < 



2' 



kA + Bktt. 
z 

These three inequalities mean that we only need to treat those triangles {A, B) which lie in the interior of 
which is a triangular region of the AB-plame sketched in the diagram below. 




O.ZS 0.5 0.1S' 1 1.25 l.S 1.75 2 

So our problem then is to find the maximum and minimum value taken 

/ {A, B) = sin A + sinS + sin {-k - A - B) = smA + sinB + sin {A + B) 

in the region R. 

• InternEil Extrema: At the internal extrema fA = fB=Q and so we have 



df 



: cos A + COS (A + B) = 0, 



df 



cosB + cos{A + B) = 0. 



dA ' ^ ' ' ' dB 

These two equations hold when 

COS A = COS S = - cos {A + B) ^ cos [n - A - B) 

which means that A = B = 'k — A — Bior the allowed values of A, B. In short, it means that the triangle 
is equilateral. For such triangles / = 3a/3/2, which is the maximum value that / can take inside R. This 
is the only internal extremum. 

• External Extrema: instead of having just two endpoints to check, as in the one-variable case, our 
boundary now comprises three sides of a triangle. These are given by: 

A = 7r/2, < S < 7r/2, 
B = 7r/2, < A < 7r/2, 
A + B = 7r/2, 0^A,B^tt/2. 
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We will treat the first side and set A = 7r/2. Then 



= sin-+sinB + sin(^-+Bj 
= 1 + sin B + cos B 

= i+v2cos(^B -'^y 

We can see then that, as B varies over [0, 7r/2] then / is minimal at B = and tt/2 at which points / = 2 
and has a maximum of 1 + y/2 on this side (and this is less than 3-\/3/2. Note that when A = 7r/2 and 
-B = or 7r/2 we don't really have a triangle, in the normal sense of the word — rather we have two 
parallel lines which meet at infinity. 

If wc now wished wc could deal with the remaining two sides similarly, but it is easier to note that they 
will be the same by the symmetry of the situation — having a right angle at A will result in triangles 
congruent to those arising from having a right angle at B or C. 

So finally we have our maximum and minimum for /, and noting that the minimum is not actually achieved 
by a proper triangle we have 

3 v^3 

2 < sin A + sinS + sinC < 
for any triangle with no obtuse angles. 



11.1 Exercises 

Exercise 208 Find and classify the critical points of f (x, y) = — 2x^ + + 6xy. 
Exercise 209 Find and classify the critical points of f {x, y) — x^ +y^ + 3xy. 
Exercise 210 Find and classify the critical points of f {x, y) = xy {x^ + — l) . 
Exercise 211 Find and classify the critical points of f {x, y) = sin^ x + sin^ y — cos^ x cos^ y. 
Exercise 212 Find the stationary values of 

f {x, y)=x^ +x'^ + 2axy + + 2ax + 2y 

(a) when a > 1, (b) when a < 1, 

and classify them as maxima, minima or saddle points. 

Exercise 213 Distinguishing cases determine the number of critical points of 

f {x, y) = x^ - 12xy + 48x + cy"^, 

where c e M. 

Exercise 214 Find and classify the critical points of f (a;, y) = 5x + 15y — (a;^ + y^)^ . 

Exercise 215 By making a change to polar co-ordinates, find the maximum and minimum values of x^ + y^ 
on the curve with equation 

x^ + 2a;y + 2y2 = 8. 

Exercise 216 By considering V/ where f {x, y) = x^ /A + y^ , or otherwise, find the shortest distance from the 
line X + y = 3 to the ellipse x^/A + y^ = 1. 
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Exercise 217 By using appropriate parameters find the maximum of f {x, y, z) = x^ +y^ + 

(i) subject to x^ + y'^ + z"^ = 1; 

(ii) subject to x^ + y"^ + z'^ = 1 and x + y + z = Q. 

Exercise 218 Find the greatest value of f {x,y) = 1 + xy + yz on the unit sphere x'^ +y^ + z"^ = 1. 
Exercise 219 * Show that if x,y,z > then 



a + + c 



Deduce for a,b,c> that 

ab^c^ < 108 

Exercise 220 * (a) Let a,b,c € R. Show that f {x,y) = ax^ + 2bxy + cy^ has a critical point at the origin. 
Show further that 

f{x,y) = {x,y)A[^ 



where 

a b 



b 



c 



(b) Show that the matrix A has two real eigenvalues Ai,A2 and that there are orthonormal vectors vi and V2 
such that 

Awi = AjVj fori = 1,2. 

(c) Show that the origin is a minimum, maximum or saddle respectively if Xi, X2 are both positive, both negative 
or of mixed signs. 

Exercise 221 * The point xq = {xo,yo,Zo) is a maximum or minimum of a smooth function / (x) , where 
X = {x, y, z), subject to the constraint that x lies on the surface g (x) = 0. Let r [t) = {x [t) ,y{t),z {t)) be a 
smooth curve which lies in the surface 5 (x) = and such that r (0) = xq. 

Write F (t) = f {r{t)). Explain why F' (0) = for all such curves and deduce that V/(xo) is normal to 
g (x) =0 at Xq. Hence show that there is a constant A such that 

V/(xo) = AV5(xo). 

This is known as the method of Lagrangian multipliers. 

Exercise 222 * Use Lagrangian multipliers to find the shortest distance from the origin to the surface 

X = yz + 10. 

Exercise 223 * Use Lagrangian multipliers to find the shortest distance from the origin to the line of intersec- 
tion of the planes 2x + y — z = 1 and x — y + z = 2. Repeat the calculation using geometric methods. 

Exercise 224 * Use Lagrangian multipliers to find the largest triangle that can be inscribed in the ellipse 
x^a'^+y^b'' = 1. 
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